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*g (54) Title: ENHANCED ACCUMULATION OF CAROHENOIDS IN PLANTS 

(57) Abstract: The present invention relates to polynucleotides and their use in methods of increasing the carotenoid content of 
seeds. In particular the invention provides a polynucleotide comprising: (a) a region which comprises as opcrably linked components 
(i) a promoter which provides for seed preferred expression; and (ii) a nucleotide sequence derived from a bacterium which sequence 
encodes a carotene desaturase; and (iii) a transcription termination region; and (b) a further region which comprises as operably linked 
components (i) a promoter which provides for seed preferred expression; and (ii) a nucleotide sequence encoding a phytoene synthase 
which sequence is derived from maize (Zea sp.) or rice (Orzya sp.); and (iii) a transcription termination region. The disclosed 
polynucleotides are particularly suitable for use in production of rice seed which comprise high amounts of coloured carotenoids. 
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ENHANCED ACCUMULATION OF CAROTENOIDS IN PLANTS 

The present invention relates inter alia, to recombinant DMA technology. More 
specifically the invention relates to the provision of improved polynucleotides which 
provide for an enhanced accumulation of carotenoids in plants and in particular in the 
seeds of said plants. The invention also provides plant material, plants and seeds which 
comprise the polynucleotides, in particular rice plant material, rice plants and rice seeds. 

Carotenoids are 40-carbon (C 40 ) isoprenoids formed by condensation of eight 
isoprene units derived from the biosynthetic precursor isopentenyl diphosphate. By 
nomenclature, carotenoids basically fall into two classes, namely, carotenes and 
xanthophylls. Their essential function in plants is to protect against photo-oxidative 
damage in the photosynthetic apparatus of plastids. In addition to this, carotenoids 
participate in light harvesting during photosynthesis and represent integral components of 
photosynthetic reaction centres. Carotenoids are the direct precursors of the 
phytohormone abscisic acid. A part of the carotenoid biosynthesis pathway is shown in 
Figure 1. 

Carotenoids with provitamin A activity are essential components of the human 
diet. Additionally, (here is compelling evidence suggesting that a diet rich in carotenoids 
can prevent a number of serious medical conditions from developing, including certain 
cancers (especially lung and prostate), macular degeneration, cateract and cardiovascular 
disease. Carotenoids have also been reported to have immunomodulatory effects, such as 
the reduction in UV-induced immunosuppression. Carotenoids are able to act as efficient 
quenchers of harmful reactive oxygen species such as singletoxygen and thereby have 
antioxidant properties. 

With the population of the world increasing, there remains a need for the 
production of foods which are high in nutrition, healthy, tasty and visually appealing. 

The general insertion of genes involved in the carotenoid biosynthesis pathway 
into plants is disclosed in WOOO/53768. US6.429.356 describes methods for the 
production of plants and seeds having altered fatty acid, tocopherol and carotenoid 
compositions via insertion of a crtB gene. The present invention provides, inter alia, 
improved polynucleotides which when inserted into plant material provide for a 
surprisingly high accumulation of carotenoids in at least the seeds of plants derived from 
said material. More specifically, the present invention provides particular combinations 
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of nucleotide sequences which, when expressed in plant material provide for a 
surprisingly high accumulation of carotenoids in at least the seeds of plants derived from 
said material. 

According to the present invention there is provided a polynucleotide comprising: 

(a) a region which comprises as operably linked components (i) a promoter which 
provides for seed preferred expression; and (ii) a nucleotide sequence derived from a 
bacterium which sequence encodes a carotene desaturase; and (iii) a transcription 
termination region; and (b) a further region which comprises as operably linked 
components (i) a promoter which provides for seed preferred expression; and (ii) a 
nucleotide sequence encoding a phytoene synthase which sequence is derived from maize 
(Zea sp.) or rice (Orzya sp.); and (iii) a transcription termination region. 

The invention further provides a polynucleotide comprising: (a) a region which 
comprises as operably linked components (i) a promoter which provides for seed 
preferred expression; and (ii) a nucleotide sequence derived from a bacterium which 
sequence encodes a carotene desaturase; and (iii) a transcription termination region; and 

(b) a further region which comprises as operably linked components (i) a promoter which 
provides for seed preferred expression; and (ii) a nucleotide sequence encoding a 
phytoene synthase which sequence is derived from tomato (Ly copers icon sp.) or pepper 
{Capsicum sp.) or a bacterium; and (iii) a transcription termination region. 

In a further embodiment said carotene desaturase is derivable from Streptomyces; 
Staphylococcus; Synechocystis; Rhodobacter; Paracoccus; Erwinia; and 
Xanthophyllomyces. In a further embodiment said carotene desaturase is a phytoene 
desaturase. In a further embodiment when said phytoene synthase is derivable from a 
bacterium, it is derivable from Streptomyces; Staphylococcus; Synechocystis; 
Rhodobacter; Paracoccus; Erwinia; and Xanthophyllomyces. In a further embodiment of 
the invention said phytoene synthase is obtainable from Zea mays or Orzya sativa. In a 
still further embodiment of the invention said phytoene synthase is obtainable from 
Lycopersicon esculentwn or Capsicum anuum. 

In a further embodiment said phytoene synthase is obtained from Zea mays or 
Orzy>a sativa. In a still further embodiment said phytoene synthase comprises, is 
comprised by or consists of the sequence selected from the group depicted as SEQ ED 
NOS: 10; 11; 12; and 13. In a further embodiment said phytoene synthase comprises, is 
comprised by or consists of a sequence which encodes the protein depicted as SEQ ID 
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NO 14. Alternative phytoene synthase encoding sequences may also be available from 
databases known to the person skilled in the art. For example, the sequences depicted in 
the EMBL Database under Accession Numbers - AY024351, AK073290, AK 103 154 
andAY073162. 

> In a still further embodiment said phytoene synthase is derived, obtainable or 

obtained from Lycopersicon esculentum or Capsicum anuum. In a still further 
embodiment said phytoene synthase comprises, is comprised by or consists of SEQ ID 
NO: 15 or SEQ ID NO: 16. 

In a still &rther embodiment said phytoene synthase is derived, obtainable or 
obtained from the CrtB gene from bacteria. In a particular embodiment said crtB 
comprises, is comprised by or consists of the crtB sequence depicted in Shewmaker et al. 
(1999) Plant J. 20:401-12. In a further embodiment of the invention the sequence which 
encodes a carotene desaturase and/or the sequence encoding a phytoene synthase from a 
bacteria is derived from Erwinia sp., more specifically, Erwinia uredovora. In a still 
further embodiment the carotene desaturase is derived from the CrtI gene from Erwinia 
uredovora. In a still further embodiment the carotene desaturase is the CrtI gene from 
Erwinia uredovora. In a still further embodiment the carotene desaturase comprises, is 
comprised by or consists of the sequence depicted as SEQ TD NO: 18 or SEQ ID NO: 19. 

The present invention further provides a polynucleotide as described above 
wherein said promoter is selected from the Glutelin 1 promoter and the Prolamin 
promoter and said transcription termination region is selected from the Nos; CaMV 35S 
and PotPl-n transcription termination regions. Said Glutelin 1 and Prolamin promoters 
may be isolated from rice. In a particular embodiment said promoter comprises the 
sequence depicted as SEQ ID NO: 20 or 21. Further promoters include the promoter 
derived from the napin gene from Brassica napus and other promoters which are derived 
from genes normally expressed in the endosperm of the seed. Further transcription 
termination regions include the terminator region of a gene of alpha-tubulin (EP-A 
652,286). It is equally possible to use, in association with the promoter regulation 
sequence, other regulation sequences which are situated between the promoter and the 
sequence encoding the protein, such as transcriptional or translation^ enhancers, for 
example, tobacco etch virus (TEV) translation activator described in International Patent 
application, PCT publication number WO87/07644. 
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Thepresem invention still tamer provides a polynucleotide as desoribed above 
wher ein ,he sequence wbich encodes carotene desaturase and the sequence whrch 
Lodes phyloene synmase forth, eonrprises a plastid targeting sequence, fa a particuta 

carboxylase small-aubuni, (RUBISCO Sao) from Pisu,n scUi^n. h . W» 
embodinren, said RUBISCO Ssu piasod targeting sequence is located 5,o the 
nanslationa, star, point of said carotene desato.se gene denved from Oft hr a ul. 
ft^er embodimen, said RUBISCO Ssu plastid ,arge,mg sequence „ laoa.eti tothe 
National slart point of aaid phymene symhase gene derived from CrtB. In ***** 
Tbodimen, aaid plastid targeting sequence is beteroiogous witb respect to sard pbytoene 
U-and/orsaid — desatau*. In a still tatter embodiment sard, lastid 
Tgeting sequence is autoiogous with respect to said pbyroenc synthase and/or sard 
carotene deaataase. By ■■beteroiogous" is mean, from a differen, source, and 
correspondingly "autologous" means from me same source - bot a. a gene rather man 
lanLortissuelevel. In a still former embodimen, me plastid targehng sequence 

_ associated with Or. pbytoene synthase is autologous therewtih. In a S tr> tather 

ta "Tpt:: — .Ufbrrherprovrdesapol^eotideasdeactihedahove 
serein the sequence which encodes said carotene desaturase and/or the sequence whrc 
rrdesaaidpnytoenesy.maseformercomprisesanintion. In a particular embodrnren, 
said intion region is .coaled between me promoter region and the regton encodmg the 
carotene desahrrase/phytoene synthase, h, a tamer embodiment said ration regron, 
.ocated between me promoter region and tire plastid targeting sequence. In a ,.11 tamer 
embodiment said mtion region is located upstream of said caro.ene dessturase and/or 
" pbytoene synmase encoding sequence,, In a still tather embodimen. sard mtion rs 
derived bom .he intion of.be first gene torn me ca„ase gene of the castor bean pi*,.. 
In . m tamer embodimen, said innon is fiom ,he firs, intion of me catalase gene tiom 
Ore casta bean plant. In a still tather embodimen, said intion comprises the sequence 
depicted as SEQ ID NO: 22. hr a stil, further embodimen. the intion rs the mtion of me 
maize polyubiquitin gene. 
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Thepresen, bvmtim iurmerprovides apoiynucleotide as described above 
wherein said seance e „ codii , s tar01en£ ^ ^ f , ^ ^ 

encoding phytoene synthase. 

^P-^inventionfurtherprovidesapolynucleotideasd^ 
where, sa,d sequence encoding phytoene synthase is located 5» to said sequent 
encoding carotene desaturase. 

which composes die sequence seto ed ^ fc ^ ^ _ SEQ ffi ^ 

* 5 ^«^P^*e mlMfcra(<)ffteillve „ tiolllllepoly]iucltoti . (i e ^'- 
.0 ^^ tomthegroiTdepictedasSEQIDNO:1;2;3;4;5aiid6]na 

NO: ,,.«, n^bommen.ofmemv.nnon ^^^^^ 
cons 1St5 „ fSEQroNO:2 . basimtothcrembodimentofteinvaiaMte P 

Po'™ e oo mpris es„ r co MisB „ f SEQmN0..3. Innsffl^^,^ 

which comprises or consists of a sequence selected from tn ., . 

NOS: 7; 8; and 9. 8r °" P M SB< 3 10 

tho eompiemen, of one which hybridises to a poiynueieotide as described in the 
preceduig paragraph a, a temper of aboo, 65°C in a solution c „„ taining 6 x ssc 
0.0,,. SDS and 0,5 % dunned mi,, po wte , &Uowed by ^ „ fc ^ ^ 
temperature in a solution containing 0.2 x SSC and 0. 1% SDS wherein said 

further re^ encoding , pbyt ^ ^ ^ ^ ^ 
,nser,edm,„ plmlmateal0leradofaptatregmmted ^^ dmaiHi 
tnereased amoun, of camtenoida when compared to a control liWed. lUeaKilled 

a. a temperahne of between 60X and o5X in 0.3 strength citrate buffered sauna 
-WnmgO.mSDS Mowed byt^htg „ (he same temper wim 0.3 strength 
nitrate buffered saline containing 0. 1% SDS Mowed by confirmation ma, when the 
polynucleotide sequence so identified is inserted ta pIant maKria , ^ ^ 
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regenerated from said material produce an increased amount of carotenoids when 
compared to a control like-seed. The person skilled in the art may also select further 
hybridisation conditions that are equally understood to be "high stringency" conditions, 
hi a particular embodiment of the present invention when said polynucleotide sequence is 
inserted into plant material, the seed of a plant regenerated from said material produces at 
least a sixty fold increase in carotenoids when compared to a control like-seed. In a 
further embodiment of the invention when said polynucleotide sequence is inserted into 
plant material, the seed of a plant regenerated from said material produces at least a one 
hundred fold increase in carotenoids when compared to a control like-seed. In a still 
further embodiment of the invention when said polynucleotide sequence is inserted into 
plant material, the seed of a plant regenerated from said material produces at least a one 
hundred and fifty fold increase in carotenoids when compared to a control like-seed. In a 
still further embodiment of the invention when said polynucleotide sequence is inserted 
into plant material, the seed of a plant regenerated from said material produces at least a 
two hundred fold increase in carotenoids when compared to a control like-seed. In a still 
further embodiment of the invention when said polynucleotide sequence is inserted into 
plant material, the seed of a plant regenerated from said material produces at least a two 
hundred and fifty fold increase in carotenoids when compared to a control like-seed. In a 
still further embodiment of the invention when said polynucleotide sequence is inserted 
into plant material, the seed of a plant regenerated from said material produces at least a 
three hundred fold increase in carotenoids when compared to a control like-seed. In a 
still further embodiment of the invention when said polynucleotide sequence is inserted 
into plant material, the seed of a plant regenerated from said material produces at least a 
three hundred and fifty fold increase in carotenoids when compared to a control like- 
seed. In a still further embodiment of the invention when said polynucleotide sequence is 
inserted into plant material, the seed of a plant regenerated from said material produces at 
least a four hundred fold increase in carotenoids when compared to a control like-seed. 
In a still further embodiment of the invention when said polynucleotide sequence is 
inserted into plant material, the seed of a plant regenerated from said material produces at 
least a five hundred fold increase in carotenoids when compared to a control like-seed. 

The term control like-seed relates to seeds which are substantially similar to those 
according to the invention but which control like-seed does not contain the 
polynucleotides or polynucleotide sequences according to the invention. Typically, a 
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control like-seed will comprise a seed of the same or similar plant species which control 
like-seed has not been transformed. The increased carotenoid content of the seeds 
comprising the polynucleotides or polynucleotide sequences according to the invention 
may a iso be demonstrated via comparison of said seeds with seeds that comprise the 
TDNA depicted in Plasmid A of Figure 4 of WO00/53768, wherein the phytoene 
synthase (psy) is from daffodil (Narcissus pseudonarcissus). Typically, such a 
comparison would be made when the seed to be compared are of the same or a 
substantially similar plant species. In a particular embodiment the seed comprising the 
polynucleotides or polynucleotide sequences according to the invention contain at least 
three times the amount of carotenoids when compared to a seed that comprise the TDNA 
depicted in Plasmid A of Figure 4 of WOOO/53768, wherein the phytoene synthase (psy) 
is from daffodil (Narcissus pseudonarcissus). In a still further embodiment the seed 
comprising the polynucleotides or polynucleotide sequences according to the invention 
contains at least four times, or at least five times, or at least six times, or at least seven 
times, or at least eight times or at least nine times or at least ten times or at least fifteen 
times or at least twenty times or at least thirty times, or at least forty times or at least fifty 
times the amount of carotenoids when compared to a seed that comprise the TDNA 
depicted in Plasmid A of Figure 4 of WOOO/53768, wherein the phytoene synthase (psy) 
is from daffodil (Narcissus pseudonarcissus). 

The present invention still further provides a polynucleotide sequence as 
described above wherein when said polynucleotide sequence is inserted into plant 
material, the seed of a plant regenerated from said material produces carotenoids at a 
level of at least 3ug/g of endosperm of said seed. In a further embodiment when said 
polynucleotide sequence as described above is inserted into plant material, the seed of a 
plant regenerated from said material produces carotenoids at a level of at least 4ng/g of 
endosperm of said seed. In a further embodiment when said polynucleotide sequence as 
described above is inserted into plant material, the seed of aplant regenerated from said 
material produces carotenoids at a level of at least 5pg/g of endosperm of said seed. In a 
still further embodiment when said polynucleotide sequence as described above is 
inserted into plant material, the seed of a plant regenerated from said material produces 
carotenoids at a level of at least 6ug/g of endosperm of said seed In a still further 
embodiment when said polynucleotide sequence as described above is inserted into plant 
material, the seed of a plant regenerated from said material produces carotenoids at a 
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level of at least 7jig/g of endosperm of said seed. In a still further embodiment when said 
polynucleotide sequence as described above is inserted into plant material, the seed of a 
plant regenerated from said material produces carotenoids at a level of at least 3p,g/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide 
sequence as described above is inserted into plant material, the seed of a plant 
regenerated from said material produces carotenoids at a level of at least 9fig/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide 
sequence as described above is inserted into plant material, the seed of a plant 
regenerated from said material produces carotenoids at a level of at least lOfig/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide 
sequence as described above is inserted into plant material, the seed of a plant 
regenerated from said material produces carotenoids at a level of at least 1 1 j^g/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide 
sequence as described above is inserted into plant material, the seed of a plant 
regenerated from said material produces carotenoids at a level of at least 12jug/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide 
sequence as described above is inserted into plant material, the seed of a plant 
- regenerated from said material produces carotenoids at a level of at least 13jag/g of : 
endosperm of said seed. In a still further embodiment when said polynucleotide 
sequence as described above is inserted into plant material, the seed of a plant 
regenerated from said material produces carotenoids at a level of at least 14p,g/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide 
sequence as described above is inserted into plant material, the seed of a plant 
regenerated from said material produces carotenoids at a level of at least 15p,g/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide 
sequence as described above is inserted into plant material, the seed of a plant 
regenerated from said material produces carotenoids at a level of at least 20p.g/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide sequence 
as described above is inserted into plant material, the seed of a plant regenerated from 
said material produces carotenoids at a level of at least 25/J.g/g of endosperm of said seed. 
In a still further embodiment when said polynucleotide sequence as described above is 
inserted into plant material, the seed of a plant regenerated from said material produces 
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carotenoids at a level of at least 30ug/g of endosperm of said seed. In a still further 
embodiment when said polynucleotide sequence as described above is inserted into plant 
material, the seed of a plant regenerated from said material produces carotenoids at a 
level of at least 35ug/g of endosperm of said seed. In a still further embodiment when 
said polynucleotide sequence as described above is inserted into plant material, the seed 
of a plant regenerated from said material produces carotenoids at a level of at least 
40p.g/g of endosperm of said seed. In a still further embodiment when said 
polynucleotide sequence as described above is inserted into plant material, the seed of a 
plant regenerated from said material produces carotenoids at a level of at least 45p.g/g of 
endosperm of said seed. In a still further embodiment when said polynucleotide sequence 
as described above is inserted into plant material, the seed of aplant regenerated from 
said material produces carotenoids at a level of at least 50ug/g of endosperm of said seed. 
In a still further embodiment when said polynucleotide sequence as described above is 
inserted into plant material, the seed of a plant regenerated from said material produces 
carotenoids at a level of at least 55ug/g of endosperm of said seed. In a still further 
embodiment when said polynucleotide sequence as described above is inserted into plant 
material, the seed of a plant regenerated from said material produces carotenoids at a 
level of at least 60>g/g of endosperm of said seed. In a still further embodiment when 
said polynucleotide sequence as described above is inserted into plant material, the seed 
of a plant regenerated from said material produces carotenoids at a level of at least 
65ug/g of endosperm of said seed. In a particular embodiment the amount of carotenoids 
is calcinated as ng/g of dry weight of endosperm of said seed. 

The present invention still further provides a polynucleotide sequence which is 
the complement of one which hybridises to a polynucleotide which consists of a 
sequence selected from the group depicted as SEQ ID NOS: 1; 2; 3; 4; 5; and 6 at a 
temperature of about 65°C in a solution containing 6 x SSC, 0.01% SDS and 0.25% 
skimmed milk powder, followed by rinsing at the same temperature in a solution 
containing 0.2 x SSC and 0.1% SDS wherein said polynucleotide sequence still 
comprises a region encoding a carotene desaturase and a further region encoding a 
phytoene synthase and when said polynucleotide sequence is inserted into plant material 
the seed of a plant regenerated from said material produce carotenoids amounting to at 
least 80% of the carotenoid content of a seed which comprises a polynucleotide selected 
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■^EQ ID NOS: 1; 2; 3; 4; 5 and 6. In a further embodiment, . 
^ \ence is inserted into plant material, the seed of a plant 
/^produce carotenoids amounting to at least 85% of the 
^dTseed which comprises a polynucleotide selected from the group 
^s'SEQ ID NOS: 1; 2; 3; 4; 5 and 6. In a still further embodiment when said 
"polynucleotide sequence is inserted into plant material, the seed of a plant regenerated 
from said material produce carotenoids amounting to at least 90% of the carotenoid 
content of a seed which comprises a polynucleotide selected from the group depicted as 
SEQ ID NOS: 1; 2; 3; 4; 5; and 6. In a still further embodiment when said 
10 polynucleotide sequence is inserted into plant material, the seed of a plant regenerated 
from said material produce carotenoids amounting to at least 95% of the carotenoid 
content of a seed which comprises a polynucleotide selected from the group depicted as 
SEQ ID NOS: 1 ; 2; 3; 4; 5 and 6. In a still further embodiment when said polynucleotide 
sequence is inserted into plant material, the seed of a plant regenerated from said material 
15 produce carotenoids amounting to at least 100% of the carotenoid content of a seed 

which comprises a polynucleotide selected from the group depicted as SEQ ID NOS: 1; 
2; 3; 4; 5 and 6. In a particular embodiment the polynucleotide sequence provides for a 
percentage of the carotenoid content of seed as described above wherein the seed with 
which the comparison is made comprises the polynucleotide depicted as SEQ ID NO: 1. 
20 In a particular embodiment the polynucleotide sequence provides for a percentage of the 
carotenoid content of seed as described above wherein the seed with which the 
comparison is made comprises the polynucleotide depicted as SEQ ID NO: 2. In a 
particular embodiment the polynucleotide sequence provides for a percentage of the 
carotenoid content of seed as described above wherein the seed with which the 
25 comparison is made comprises the polynucleotide depicted as SEQ ID NO: 3. In a 
particular embodiment the polynucleotide sequence provides for a percentage of the 
carotenoid content of seed as described above wherein the seed with which the 
comparison is made comprises the polynucleotide depicted as SEQ ID NO: 4. In a 
particular embodiment the polynucleotide sequence provides for a percentage of the 
30 carotenoid content of seed as described above ivherein the seed with which the 
comparison is made comprises the polynucleotide depicted as SEQ ID NO: 6. 

It is preferred that when the carotenoid content of the seed comprising said 
polynucleotide sequence is compared with the seed comprising the polynucleotide 
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selected from the group depicted as SEQ ID NOS: 1; 2; 3; 4; 5 and 6, the seed are I 
plants of substantially the same speeies. It is further preferred that when the carotenoid 
content of the seed comprising said polynucleotide sequence is compared with the seed 
comprising the polynucleotide selected from the group depicted as SEQ ID NOS: 1; 2; 3; 
4; 5 and 6, the seed are from plants which are substantially genetically identical save for 
the presence of said polynucleotide or said polynucleotide sequence. It is further 
preferred that when the carotenoid content of the seed comprising said polynucleotide 
sequence is compared with the seed comprising the polynucleotide selected from the 
group depicted as SEQ ID NOS: 1; 2; 3; 4; 5 and 6, the seed are from plants which are 
grown subject to the same environmental growing conditions. 

The present invention still further provides a polynucleotide sequence which is 
the complement of one which hybridises to a polynucleotide which consists of a 
sequence selected from the group depicted as SEQ ID NOS: 7; 8; and 9 at a temperature 
of about 65°C in a solution containing 6 x SSC, 0.01% SDS and 0.25% skimmed milk 
powder, followed by rinsing at the same temperature in a solution containing 0.2 x SSC 
and 0. 1% SDS wherein said polynucleotide sequence still comprises a region encoding a 
carotene desaturase and a further region encoding a phytoene synthase and when said 
polynucleotide sequence is inserted into plant material the seed of a plant regenerated 
from said material produce carotenoids amounting to at least 80% of the carotenoid 
content of a seed which comprises a polynucleotide selected from the group depicted as 
SEQ ID NOS: 7; 8 and 9. It is preferred that when the carotenoid content of the seed 
comprising said polynucleotide sequence is compared with the seed comprising the 
polynucleotide selected from the group depicted as SEQ ID NOS: 7; 8 and 9, the seed are 
from plants of substantially the same species. It is further preferred that when the 
carotenoid content of the seed comprising said polynucleotide sequence is compared with 
the seed comprising the polynucleotide selected from the group depicted as SEQ ID 
NOS: 7; 8 and 9, the seed are from plants which are substantially genetically identical 
save for the presence of said polynucleotide or said polynucleotide sequence. 

In a particular embodiment the polynucleotide sequences according to the 
invention which are identified based on their hybridisation (under the conditions 
provided) to the sequences described in the Sequence Listing, encode the same proteins 
as those provided by the sequences in the Sequence Listing. In a further embodiment, 
said polynucleotide sequences may encode proteins which have the same or a similar 
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function as the proteins encoded by the sequences in the Sequence Listing. In a still 
further embodiment, the proteins encoded. by the polynucleotide sequence according to 
the invention comprise amino acid substitutions and/or deletions when compared to the 
porteins encoded by the sequences in the Sequence Listing. In a still further 
embodiment, said amino acid substitutions are "conservative" substitutions. A 
"conservative" substitution is understood to mean that the amino acid is replaced with an 
amino acid with broadly similar chemical properties. In particular "conservative" 
substitutions may be made between amino acids within the following groups: (i) Alanine 
and Glycine; (ii) Threonine and Serine; (ii) Glutamic acid and Aspartic acid; (iii) 
Arginine and Lysine; (iv) Asparagine and Glutamine; (v) Isoleucine and Leucine; (vi) 
Valine and Methionine; and (vii) Phenylalanine and Tryptophan. 

The present invention still further provides a polynucleotide or a polynucleotide 
sequence as described above which when inserted into plant material, the seed of a plant 
regenerated from said material produces carotenoids at levels which are higher than those 
present in native like-seeds. The present invention still further provides a polynucleotide 
or a polynucleotide sequence as described above which when inserted into plant material, 
the seed of a plant regenerated from said material produces carotenoids at levels which 
are higher than those present in untransformed like-seeds. 

In a particular embodiment, the carotenoids which are increased are selected from 
the group consisting of: lycopene; alpha-carotene; lutein; beta-carotene; zeaxanthin; beta- 
cryptoxanthin; antheraxanthin; violaxanthin; and neoxanthin or a combination thereof. In 
a further embodiment, the carotenoids which are increased are selected from the group 
consisting of: lycopene; alpha-carotene; lutein; beta-carotene; zeaxanthin; beta- 
cryptoxanthin; or a combination thereof. In a still further embodiment, the carotenoids 
which are increased are selected from the group consisting of: alpha-carotene; lutein; 
beta-carotene; zeaxanthin; beta-cryptoxanthin; or a combination thereof. In a still further 
embodiment, the carotenoids which are increased include at least phytoene and beta- 
carotene. In a still further embodiment, the carotenoids which are increased include at 
least beta-carotene. In a still farther embodiment, the carotenoid which is increased is 
beta-carotene. In a still further embodiment the carotenoids which are increased are 
coloured carotenoids. 

The present invention still further provides a polynucleotide or a polynucleotide 
sequence as described above wherein said seed is a rice seed. In a particular embodiment 
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of the invention, before the seeds are analysed for their carotenoid content, the seeds are 
prepared prior to the analysis. Such preparation may include, for example, with respect 
to rice seed, "dehusking" and "polishing". Furthermore, such preparation may involve 
the removal of those plant parts associated with the seed which plant parts are not 
5 normally intended for human consumption. 

The amount of carotenoids in the seeds can be determined using techniques which 
are well known and available to the person skilled in the art. Such techniques include but 
are not necessarily limited to High Performance Liquid Chromatography (HPLC) 
analysis and spectrophotometry. 
1 0 The present invention still further provides a polynucleotide or polynucleotide 

sequence as described above which further comprises a region which encodes a 
selectable marker. In a particular embodiment said selectable marker comprises a 
mannose-6-phosphate isomerase gene. In a further particular embodiment the selectable 
marker used is the mannose-6-phosphate isomerase gene according to the Positech™ 
15 selection system. Li a specific embodiment said selectable marker is the one as described 
in European Patent/Application publication Number EP 0 896 063 and EP 0 601 092. 
Alternatively, the selectable marker used may, in particular, confer resistance to 
kanamycin, hygromycin or gentamycin. Further suitable selectable markers include 
genes that confer resistance to herbicides such as glyphosate-based herbicides (e.g. 
20 • EPSPS genes such as in USP 5510471 or WO 00/66748) or resistance to toxins such as 
eutypine. Other forms of selection are also available such as hormone based selection 
systems such as the Multi Auto Transformation (MAT) system of Hiroyrasu Ebinuma et 
al. 1997. PNAS Vol. 94 pp21 17-2121; visual selection systems which use fluorescent 
proteins, p glucoronidase and any other selection system such as xylose isomerase and 2- 
25 deoxyglucose (2-DOG). 

The present invention still further provides a polynucleotide, or a polynucleotide 
sequence according to the invention which is codon optimised for expression in a 
particular plant species. In a particular embodiment the polynucleotide or polynucleotide 
sequence is codon optimised for expression in rice (Orzya sp.) or maize (Zea sp.). Such 
30 codon optimisation is well known to the person skilled in the art and the table below 
provides an example of the plant-preferred codons for rice and maize. 
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Amino Acid 


ivice preierciiLe 


iviaize prLiercnce 


Alanine 


GCC 


GCC 


Arginine 


CGC 


AGG 


Asparagine 


AAC 


ACC 


Aspartic Acid 


GAC 


GAC 


Cysteine 


TGC 


TGC 


Glutamine 


CAG 


CAG 


Glutamic Acid 


GAG 


GAG 


Glycine 


GGC 


GGC 


Histidine 


CAC 


CAC 


Isoleucine 


ATC 


ATC 


Leucine 


CTC 


CTG 


Lysine 


AAG 


AAG 


Methionine 


ATG 


ATG 


Phenylalanine 


TTC 


TTC 


Proline 


CCG 


CCG 


Serine 


TCC 


AGC 


Threonine 


ACC 


ACC 


Tryptophan 


TGG 


TGG 


Tyrosine 


TAC 


TAC 


Valine 


GTG 


GTG 



The present invention further provides a vector comprising a polynucleotide or a 
polynucleotide sequence as described above. In a particular embodiment of the invention 
said vector comprises a polynucleotide selected from the group depicted as SEQ ID NO: 
5 1; 2; 3; 4; 5 and 6. In a particular embodiment of the invention said vector comprises a v 
polynucleotide selected from the group depicted as SEQ ID NOS: 7; 8 and 9. In a 
particular embodiment said vector allows for replication of said polynucleotide or 
polynucleotide sequence in a bacterium. 

The present invention still further provides a vector as described above which is a 
1 0 plant expression vector. 



WO 2004/085656 



PCT/GB2004/001241 



i 

■ \\ 



-15- 

In a particular embodiment of the invention the sequence around the translational 
start position(s) of the phytoene synthase and/or said carotene desaturase encoding 
sequences as described above may be modified such that it is "Kozak" preferred. What 
is meant by this is well known to the skilled artisan. Examples of Kozak consensus 
5 sequences which are v/ell known to the person skilled in the art include cagcc(atg) or 
agcc(atg). The phytoene synthase and/or said carotene desaturase encoding sequences as 
described above may also further comprise a sequence which provides for retention in a 
particular intracellular organelle. 

In a further aspect of the present invention there is provided a method for 

10 increasing the carotenoid content of seeds comprising inserting into plant material a 
polynucleotide or a polynucleotide sequence or a vector as described above; and 
regenerating a seed-containing plant from said material and identifying the seeds which 
contain carotenoids at levels greater that those of a control like-seed. 

The present invention still further provides a method for increasing the carotenoid 

15 content of seeds comprising inserting into plant material a polynucleotide comprising a 
sequence selected from the group depicted as SEQ ID NO: 1; 2; 3; 4; 5 and 6 and 
regenerating a seed containing plant from said material and identifying the seeds which 
contain carotenoids at levels greater that those of control like-seeds. In a particular 
embodiment of the invention, the seeds obtained by said method contain at least a sixty 

20 fold increase in carotenoids when compared to a control like-seed. In a further 
embodiment the seeds obtained by said method contain at least a one hundred fold 
increase in carotenoids when compared to control like-seeds. In a further embodiment the 
seeds obtained by said method contain at least a one hundred and fifty fold increase in 
carotenoids when compared to control like-seeds. In a further embodiment the seeds 

25 obtained by said method contain at least a two hundred fold increase in carotenoids when 
compared to control like-seeds. In a further embodiment the seeds obtained by said 
method contain at least a two hundred and fifty fold increase in carotenoids when 
compared to control like-seeds. In a further embodiment the seeds obtained by said 
method contain at least a three hundred fold increase in carotenoids when compared to 

30 control like-seeds. In a further embodiment the seeds obtained by said method contain at 
least a three hundred and fifty fold increase in carotenoids when compared to control 
like-seeds. In a further embodiment the seeds obtained by said method contain at least a 
four hundred fold increase in carotenoids when compared to control like-seeds. In a 



WO 2004/085656 



PCT/GB2004/001241 



-16- 

flirther embodiment the seeds obtained by said method contain at least a five hundred 
fold increase in carotenoids when compared to control like-seeds. 

The present invention still flulher provides a method as described above wherein 
said seed contains carotenoids at a level of at least 3^xg/g of endosperm of said seed. In a 
particular embodiment said seed contains carotenoids at a level of at least 4|ag/g of 
endosperm of said seed. In a particular embodiment said seed contains carotenoids at a 
level of at least 5|xg/g of endosperm of said seed. In a particular embodiment said seed 
contains carotenoids at a level of at least 6pg/g of endosperm of said seed. In a particular 
embodiment said seed contains carotenoids at a level of at least 7^ig/g of endosperm of 
said seed. In a particular embodiment said seed contains carotenoids at a level of at least 
8jag/g of endosperm of said seed. In a particular embodiment said seed contains 
carotenoids at a level of at least 9(iig/g of endosperm of said seed. In a particular 
embodiment said seed contains carotenoids at a level of at least lO^ig/g of endosperm of 
said seed. In a particular embodiment said seed contains carotenoids at a level of at least 
. 15|ig/g of endosperm of said seed. In a particular embodiment said seed contains 
carotenoids at a level of at least 20p,g/g of endosperm of said seed. In a particular 
embodiment said seed contains carotenoids at a level of at least 25jj.g/g of endosperm of 
said seed. In a particular embodiment said seed contains carotenoids at a level of at least 
30|ig/g of endosperm of said seed. In a particular embodiment said seed contains 
carotenoids at a level of at least 35[ig/g of endosperm of said seed. In a particular 
embodiment said seed contains carotenoids at a level of at least 40p.g/g of endosperm of 
said seed. In a particular embodiment said seed contains carotenoids at a level of at least 
45jig/g of endosperm of said seed. In a particular embodiment said seed contains 
carotenoids at a level of at least 50fxg/g of endosperm of said seed. In a particular 
embodiment said seed contains carotenoids at a level of at least 55p.g/g of endosperm of 
said seed. In a particular embodiment said seed contains carotenoids at a level of at least 
60}ig/g of endosperm of said seed. In a particular embodiment said seed contains 
carotenoids at a level of at least 65p.g/g of endosperm of said seed. 

The present invention still further provides a method for increasing the carotenoid 
content of seeds comprising inserting into plant material a polynucleotide sequence as 
described above and regenerating a seed-containing plant from said material and 
identifying the seed which contains carotenoids amounting to at least 80% of the 
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carotenoid content of a seed which comprises a polynucleotide selected from the group 
depicted as SEQ ID NOS: 1; 2; 3; 4; 5 and 6. In a further embodiment, the seed of a 
plant regenerated from said material produce carotenoids amounting to at least 85% of 
the carotenoid content of a seed which comprises a polynucleotide selected from the 
5 group depicted as SEQ ID NOS: 1 ; 2; 3; 4; 5 and 6. In a still further embodiment, the 
seed of a plant regenerated from said material produce carotenoids amounting to at least 
90% of the carotenoid content of a seed which comprises a polynucleotide selected from 
the group depicted as SEQ ID NOS: 1; 2; 3; 4; 5 and 6. In a still further embodiment, the 
seed of a plant regenerated from said material produce carotenoids amounting to at least 
10 95% of the carotenoid content of a seed which comprises a polynucleotide selected from 
the group depicted as SEQ ID NOS: 1; 2; 3; 4; 5 and 6. In a still further embodiment, the 
seed of a plant regenerated from said material produce carotenoids amounting to at least 
100% of the carotenoid content of a seed which comprises a polynucleotide selected from 
the group depicted as SEQ ID NOS: 1; 2; 3; 4; 5 and 6. In a particular embodiment the 
15 polynucleotide sequence provides for a percentage of the carotenoid content of seed as 
described above wherein the seed with which the comparison is made comprises the 
polynucleotide depicted as SEQ ID NO: 1. In a particular embodiment the 
polynucleotide sequence provides for a percentage of the carotenoid content of seed as 
described above wherein the seed with which the comparison is made comprises the 
20 polynucleotide depicted as SEQ ID NO: 2. In a particular embodiment the polynucleotide 
sequence provides for a percentage of the carotenoid content of seed as described above 
wherein the seed with which the comparison is made comprises the polynucleotide 
depicted as SEQ ID NO: 3. In a particular embodiment the polynucleotide sequence 
provides for a percentage of the carotenoid content of seed as described above wherein 
25 the seed with which the comparison is made comprises the polynucleotide depicted as 
SEQ ID NO: 4. In a particular embodiment the polynucleotide sequence provides for a 
percentage of the carotenoid content of seed as described above wherein the seed with 
which the comparison is made comprises the polynucleotide depicted as SEQ ID NO: 6. 
The present invention still further provides a method for increasing the carotenoid 
30 content of seeds comprising inserting into plant material a polynucleotide comprising a 
sequence selected from the group depicted as SEQ ID NOS: 7; 8 and 9 and regenerating 
a seed containing plant from said material and identifying the seeds which contain 
carotenoids at levels greater that those of control like-seeds. In a particular embodiment 
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of the invention, the seeds obtained by said method contain at least a fifty fold increase in 
carotenoids when compared to a control like-seed. 

The present invention still further provides a method as described above wherein 
said seed contains carotenoids at a level of at least 3p.g/g of endosperm of said seed. 
5 The present invention still further provides a method for increasing the carotenoid 

content of seeds comprising inserting into plant material a polynucleotide sequence as 
described above and regenerating a seed-containing plant from said material and 
identifying the seed which contains carotenoids amounting to at least 80% of the 
carotenoid content of a seed which comprises a polynucleotide selected from the group 
10 depicted as SEQ ID NOS: 7; 8 and 9. 

The present invention still further provides a method as described above wherein 
the carotenoids which are increased are selected from the group consisting of: lycopene; 
alpha-carotene; lutein; beta-carotene; zeaxanthin; beta-cryptoxanthin; antheraxanthin; 
violaxanthin; and neoxanthin or a combination thereof. In a further embodiment, the 
15 carotenoids which are increased are selected from the group consisting of: lycopene; 
alpha-carotene; lutein; beta-carotene; zeaxanthin; beta-cryptoxanthin; or a combination 
thereof. In a still further embodiment, the carotenoids which are increased are selected 
from the group consisting of: alpha-carotene; lutein; beta-carotene; zeaxanthin; beta- 
cryptoxanthin; or a combination thereof. In a still further embodiment, the carotenoids 
20 which are increased include at least phytoene and beta-carotene. In a still further 

embodiment, the carotenoids which are increased include at least beta-carotene. In a still 
further embodiment, the carotenoid which is increased is beta-carotene. In a still further 
embodiment the carotenoids which are increased are coloured carotenoids. 

The polynucleotide or polynucleotide sequence or vector asdescribed above may 
25 be inserted into plant material by plant transformation techniques that are well known to 
the person skilled in the art. Such techniques include but are not limited to particle 
mediated biolistic transformation, Agrobacterium-mediated transformation, protoplast 
transformation (optionally in the presence of polyethylene glycols); sonication of plant 
tissues, cells or protoplasts in a medium comprising the polynucleotide or vector; micro- 
30 insertion of the polynucleotide or vector into totipotent plant material (optionally 

employing the known silicon carbide "whiskers'* technique), electroporation and the like. 
In a particular embodiment of the invention rice plant material is transformed in 
accordance with the methods described in the Examples disclosed herein. In a particular 
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embodiment the Agrobacterium that is used is a strain that has been modified to reduce 
the possibility of recombination between sequences having a high degree of similarity 
within the TDNA region of the Agrobacterium. Furthermore, techniques and elements 
such as those referred to in WO99/01563, US6,265,633, US5,73 1,179 and US5,591,616 
5 may be employed as part of the transformation process. 

The present invention still further provides seed obtained or obtainable by a 
method as described above. In a specific embodiment said seed are rice seed. In a 
further embodiment said seeds are maize seeds. 

Througout this specification the terms "seed" and "seeds" may be interchanged 
10 with the terms "grain" or "grains". In particular, the terms "seed" and "seeds" refers to 
edible seeds or seed parts, in particular, seed endosperm. 

The present invention still further comprises a plant which comprises a seed 
according to the preceding paragraph. 

The present invention further provides a plant or plant material which comprises a 
15 polynucleotide or a polynucleotide sequence or a vector as described above. In a 

particular embodiment said plant or plant material is a rice plant or rice plant material or 
a maize plant or maize plant material. In a still further embodiment the plant or plant 
material of the present invention is selected from the group consisting watermelon, 
melon, mango, soybean, cotton, tobacco, sugar beet, oilseed rape, canola, flax, sunflower, 
20 potato, tomato, alfalfa, lettuce, maize, wheat, sorghum, rye, bananas, barley, oat, turf 
grass, forage grass, sugar cane, pepper, pea, field bean, rice, pine, poplar, apple, peach, 
grape, strawberry, carrot, cabbage, onion, citrus, cereal or nut plants or any other 
horticultural crops. In a specific embodiment said plant is a rice plant. 

The present invention still further provides a plant or seed according to the 
25 invention which further comprises a gene which gene encodes an enzyme which is 
capable of converting carotene to a retinoid. An example of such a gene is the gene 
encoding p-carotene dioxygenase as described in WO01/48162 and/or WO01/48163. 

The present invention still further provides a plant according to the invention 
which further comprises a polynucleotide which provides for a trait selected from the 
30 group consisting of: insect resistance and/or tolerance; nematode resistance and/or 

tolerance; herbicide resistance and/or tolerance; improved resistance and/or tolerance to 
stress; a substance having pharmaceutical activity and/or any other desired agronomic 
trait. 
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The present invention still further provides a plant according to the invention 
which further comprises a polynucleotide which provides for a further enhancement of 
isoprenoid biosynthesis and/or carotenoid accumulation in the plant. In a particular 
embodiment said polynucleotide provides for a transcription factor which provides for a 
further enhancement of isoprenoid biosynthesis and/or carotenoid accumulation in the 
plant. 

The present invention still further provides a plant according to the invention 
which further comprises a polynucleotide which provides for an increase in plastids 
within the plant. In a particular embodiment said polynucleotide comprises the PhyA 
gene from oats or arabidopsis which genes are well known to th person skilled in the art. 
In a further embodiment said polynucleotide comprises the Hpl or Hp2 gene from 
tomato which are also well known the the person skilled in the art. 

The present invention still further provides a molecular marker which marker is 
capable of identifying plant material which comprises a sequence selected from the' group 
depitected as SEQ ID NOS: 1 to 9 wherein said molecular marker comprises at least 
about 25 contigous nucleotides of a sequence selected from the group consisting of SEQ 
ID NOS: 1 to 9. 

The present invention still further provides a method for identifying plant material 
according to the invention via the use, for example, of the polymerase chain reaction 
(PCR). Suitable primers may be designed using parameters well known to those skilled 
in the art and based on the sequences listed in the Sequence Listing. The person skilled 
in the art is well versed in nucleic acid extraction techniques, and once a test sample has 
been isolated, it can be analysed for the presence of the sequence according to the 
invention using techniques that are well known in the art. These include, but are not 
limited to, PCR, RAPIDS, RFLPs and AFLPs. 

The present invention still further provides a kit which kit comprises a means for 
obtaining a test sample and a means for detecting the presence of the sequences of the 
invention within said test sample. Kits may also be generated that are suitable for testing 
the carotenoid content of a test sample and this may optional ly be combined with the 
features of a kit as described in the preceding sentence. 

In a further aspect of the present invention there is provided the use of a 
polynucleotide, polynucleotide sequence or a vector as described above in a method for 
the production of seeds containing increased carotenoids. In a particular embodiment the 
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present invention provides the use of a polynucleotide selected from the group depicted 
as SEQ ID NOS: 1; 2; 3; 4; 5 and 6 for the production of seeds which contain carotenoids 
at levels greater that those of a control like-seed. 

In a further aspect of the present invention there is provided the use of a 
polynucleotide, polynucleotide sequence or a vector as described above in a method for 
the production of seeds containing increased carotenoids. Li a particular embodiment the 
present invention provides the use of a polynucleotide selected from the group depicted 
as SEQ ID NOS: 7; 8 and 9 for the production of seeds which contain carotenoids at 
levels greater that those of a control like-seed. 

In a further aspect of the invention there is provided the use of a polynucleotide, 
polynucleotide sequence or a vector as described above in a method of producing a plant 
which comprises said polynucleotide, said polynucleotide sequence or said vector. 

In a further aspect of the invention there is provided the use of a polynucleotide 
selected from the group depicted as SEQ ID NOS: 1; 2; 3; 4; 5 and 6 in a method for the 
production of a plant comprising said polynucleotide. 

In a further aspect of the invention there is provided the use of a polynucleotide 
selected from the group depicted as SEQ ID NOS: 7; 8 and 9 in a method for the 
production of a plant comprising said polynucleotide. 

In a further aspect of the invention there is provided a method for increasing the 
carotenoid content of seeds comprising inserting into plant material (a) a first 
polynucleotide which comprises as operably linked components (i) a promoter which 
provides for seed preferred expression; and (ii) a nucleotide sequence derived from a 
bacterium which sequence encodes a carotene desaturase; and (iii) a transcription 
termination region; and (b) a second polynucleotide which comprises as operably linked 
components (i) a promoter which provides for seed preferred expression; and (ii) a 
nucleotide sequence encoding a phytoene synthase which sequence is derived from maize 
(Zea sp.) or rice (Orzya sp.)\ and (iii) a transcription termination region; and (c) 
regenerating a seed containing plant from said material and identifying the seeds which 
contain carotenoids at levels greater that those of control like-seeds. 

In a further aspect of the invention there is provided a method for increasing the 
carotenoid content of seeds comprising inserting into plant material (a) a first 
polynucleotide which comprises as operably linked components (i) a promoter which 
provides for seed preferred expression; and (ii) a nucleotide sequence derived from a 
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bacterium which sequence encodes a carotene desaturase; and (iii) a transcription 
termination region; and (b) a second polynucleotide which comprises as operably linked 
components (i) a promoter which provides for seed preferred expression; and (ii) a 
nucleotide sequence encoding a phytoene synthase which sequence is derived from 
5 tomato (Lycopersicon sp.) or pepper (Capsicum sp.); or a bacterium; and (iii) a 

transcription termination region; and (c) regenerating a seed containing plant from said 
material and identifying the seeds which contain carotenoids at levels greater that those 
of control like-seeds. 

In a particular embodiment, step (a) of the preceding paragraph is performed prior 

10 to step (b). In a further embodiment, step (b) of the preceding paragraph is performed 
prior to step (a), hi a still further embodiment, the promoter, the sequence encoding said 
carotene desaturase, the sequence encoding phytoene synthase and the terminator region 
are derived from the sequences depicted in the Sequence Listing. In a still further 
embodiment said carotene desaturase is the CrtI gene from Erwinia sp. In a still further 

15 embodiment the carotene desaturase comprises or consists of the sequence depicted as 
SEQ ID NO: 18 or 19. In a further embodiment said phytoene synthase is derived from 
maize or rice. In a still further embodiment said phytoene synthase is from maize or rice. 
In a still further embodiment said phytoene synthase comprises or consists of a sequence 
selected from the group depicted as SEQ ID NOS: 10; 11; 12; and 13. 

20 In a further aspect of the invention there is provided a method for increasing the 

carotenoid content of seeds comprising crossing (a) a first plant comprising a 
polynucleotide which comprises as operably linked components (i) a promoter which 
provides for seed preferred expression; and (ii) a nucleotide sequence derived from a 
bacterium which sequence encodes a carotene desaturase; and (iii) a transcription 

25 termination region; with (b) a further plant comprising a polynucleotide which comprises 
as operably linked components (i) a promoter which provides for seed preferred 
expression; and (ii) a nucleotide sequence encoding a phytoene synthase which sequence 
is derived from maize (Zea sp.) or rice (Orzya sp.); and (iii) a transcription termination 
region; and (c) harvesting seed from the female parent of the thus crossed plants; and (d) 

30 growing said seed to produce plants comprising further seeds and identifying said further 
seeds which contain carotenoids at levels greater that those of control like-seeds. 

In a further aspect of the invention there is provided a method for increasing the 
carotenoid content of seeds comprising crossing (a) a first plant comprising a 
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polynucleotide which comprises as operably linked components (i) a promoter which 
provides for seed preferred expression; and (ii) a nucleotide sequence derived from a 
bacterium which sequence encodes a carotene desaturase; and (iii) a transcription 
termination region; with (b) a farther plant comprising a polynucleotide which comprises 
5 as operably linked components (i) a promoter which provides for seed preferred 

expression; and (ii) a nucleotide sequence encoding a phytoene synthase which sequence 
is derived from tomato {Lycopersicon sp.) or pepper (Capsicum sp.); or a bacterium; and 
(iii) a transcription termination region; and (c) harvesting seed from the female parent of 
the thus crossed plants; and (d) growing said seed to produce plants comprising further 

10 seeds and identifying said further seeds which contain carotenoids at levels greater that 
those of control like-seeds. 

In a still further embodiment, the promoter, the sequence encoding said carotene 
desaturase, the sequence encoding phytoene synthase and the terminator region are 
obtainable from the sequences depicted in the Sequence Listing. In a still further 

15 embodiment sai4 carotene desaturase is the CrtI gene from Erwinia sp. In a still further 
embodiment the carotene desaturase comprises or consists of the sequence depicted as 
SEQ ID NO: 18 or 19. In a further embodiment said phytoene synthase is derived from 
maize or rice. In a still further embodiment said phytoene synthase is from maize or rice. 
In a still further embodiment said phytoene synthase comprises or consists of a sequence 

20 selected from the group depicted as SEQ ID NOS: 10; 1 1 ; 12 and 13. 

In a further aspect of the invention there is provided a polynucleotide which 
comprises (a) a region which comprises as operably linked components (i) a promoter 
which provides for seed preferred expression; and (ii) a nucleotide sequence derived 
from a bacterium which sequence encodes a carotene desaturase or a nucleotide sequence 

25 which encodes a carotene desaturase derived from a plant selected from the group 

consisting of: tomato (Lycopersicon sp.); pepper (Capsicum sp.)] maize (Zea sp.); rice 
(Orzya sp.); and (iii) a transcription termination region; and (b) a further region which 
comprises as operably linked components (i) a promoter which provides for seed 
preferred expression; and (ii) a nucleotide sequence encoding a phytoene synthase which 

30 sequence is derived from a bacterium, or from a plant selected from the group consisting 
of: tomato (Lycopersicon sp.); pepper (Capsicum sp.); maize (Zea sp.); rice (Orzya sp.); 
and (iii) a transcription termination region; and (c) a still further region which comprises 
as operably linked components (i) a promoter which provides for seed preferred 
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expression; and (ii) a nucleotide sequence encoding a zeta-carotene desaturase (ZDS) 
derived from a bacterium, or from a plant selected from the group consisting of: tomato 
(Lycopersicon sp.); pepper (Capsicum sp.); maize (Zea sp.); rice (Orzyta sp.); and (iii) a 
transcription termination region. In a particular embodiment the carotene desaturase and 
5 phytoene synthase is derived from maize (Zea sp.) and and zeta-carotene desaturase 
(ZDS) is derived from pepper (Capsicum sp.). In a further embodiment the carotene 
desaturase and phytoene synthase is from maize (Zea sp.) and the zeta-carotene 
desaturase (ZDS) is from pepper (Capsicum sp.). 

• The polynucleotides as described above may be used to identify polynucleotide 

10 sequences providing for a like-function, based on the hybrisation conditions described 
above. These polynucleotides and polynucleotide sequences which comprise the zeta- 
carotene desaturase may also be used in methods for increasing the carotenoid content of 
seeds in a manner analogous to the methods described above. 

In a further aspect of the invention there is provided a polynucleotide which 

15 comprises as operably linked components (i) a promoter which provides for seed 

preferred expression; and (ii) a nucleotide sequence derived from a bacterium which 
sequence encodes a carotene desaturase or a nucleotide sequence which encodes a 
carotene desaturase derived from a plant selected from the group consisting of: tomato 
(Lycopersicon sp.); pepper (Capsicum sp.); maize (Zea sp.); rice (Orzya sp.); and (iii) a 

20 nucleotide sequence encoding a phytoene synthase which sequence is derived from a 
plant selected from the group consisting of: tomato (Lycopersicon sp.); pepper 
(Capsicum sp.); maize (Zea sp.); rice (Orzya sp.) or a bacterium; and (iv) a nucleotide 
sequence encoding a zeta-carotene desaturase (ZDS) derived from a bacterium or from a 
plant selected from the group consisting of: tomato (Lycopersicon sp.) ; pepper 

25 (Capsicum sp.); maize (Zea sp.); rice (Orzya sp.); and (v) a transcription termination 
region. 

Any of the regions described in this specification may be separated by a region 
which provides for a self-processing polypeptide which is capable of separating the 
proteins such as the self-processing polypeptide described in US5,846,767 or any 
30 similarly functioning element. Alternatively the regions may be separated by a sequence 
which acts as a target site for an external element which is capable of separating the 
protein sequences. Alternatively the polynucleotide may provide for a polyprotein which 
comprises a plurality of protein functions. In a further embodiment of the present 
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invention the proteins of the polyprotein may be arranged in tandem. The person skilled 
in the art will appreciate that when a polynucleotide is generated which encodes such a 
polyprotein, expression of such a polyprotein may be achieved via the use of a single 
promoter which promoter is described herein. 

All of the polynucleotides and polynucleotide sequences described throughout 
this specification can be isolated and constructed using techniques that are well known to 
the person skilled in the art. For example, the polynucleotides can be synthesised using 
standard polynucleotide synthesisers. Such synthetic polynucleotides can be synthesised 
and then ligated to form the longer polynucleotides according to the invention. The 
polynucleotides and polynucleotide sequences can also be isolated from other 
constructs/vectors which contain said sequences and then be inserted into further 
constructs/vectors to produce the ones according to the invention. The sequences can 
also be isolated from libraries, for example cDNA and gDNA, using the sequence 
information provided in the Sequence Listing for the creation of suitable probes/primers 
for the purpose of identifying said sequences from said libraries. Once isolated, the 
sequences can be assembled to create the polynucleotides and polynucleotide sequences 
according to the invention. The sequences according to the invention can also be used to 
identify like-sequences in accordance with the hybridisation conditions described above. 
In identifying such like-sequences the person skilled in the art may wish to identify like- 
sequences of one or more of the component parts of the sequences depicted in the 
Sequence Listing and then subsequently assemble the like-sequences in a manner similar 
to the arrangement of the sequences depicted in the Sequence Listing. For example, it 
may be desired to modify the region encoding phytoene synthase gene only. Once the 
modified phytoene synthase encoding sequence had been identified, it could be used to 
replace the phytoene synthase encoding sequence in one of the sequences depicted in the 
Sequence Listing. Alternatively, the modified phytoene synthase may be used in the 
creation of a new sequence wherein all the component parts are the same as a sequence in 
the Sequence Listing save for the phytoene synthase. In a further example, all of the 
components may be modified and then each of the modified components is arranged in 
the same manner as the sequences of the Sequence Listing, for example, promoter-intron- 
target sequence-carotene desaturase-tenninator- promoter-intron-(target sequence) - 
phytoene synthase-terminator. The degree of modification will affect the ability of the 
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tag 




<210> 


14 


<211> 


420 


<212> 


PRT 


<213> 


Oryza sp 


<400> 


14 



1263 



Met Ala Ala lie Thr Leu Leu Arg Ser Ala Ser Leu Pro Gly Leu Ser 
1 5 .10 15 



Asp Ala Leu Ala Arg Asp Ala Ala Ala Val Gin His Val Cys Ser Ser 
20 25 30 



Tyr Leu Pro Asn Asn Lys Glu Lys Lys Arg Arg Trp lie Leu Cys Ser 
35 40 45 



Leu Lys Tyr Ala Cys Leu Gly Val Asp Pro Ala Pro Gly Glu lie Ala 
50 55 60 



Arg Thr Ser Pro Val Tyr Ser Ser Leu Thr Val Thr Pro Ala Gly Glu 
65 70 75 80 

Ala Val lie Ser Ser Glu Gin Lys Val Tyr Asp Val Val Leu Lys Gin 
85 90 55 



Ala Ala Leu Leu Lys Arg His Leu Arg Pro Gin Pro His Thr He Pro 
100 105 HO 



He Val Pro Lys Asp Leu Asp Leu Pro Arg Asn Gly Leu Lys Gin Ala 
115 120 125 



Tyr His Arg Cys Gly Glu lie Cys Glu Glu Tyr Ala Lys Thr Phe Tyr 
130 135 140 



Leu Gly Thr Met Leu Met Thr Glu Asp Arg Arg Arg Ala He Trp Ala 
145 150 155 160 



He Tyr Val Trp Cys Arg Arg Thr Asp Glu Leu Val Asp Gly Pro Asn 
165 170 175 
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Ala Ser His lie Thr Pro Ser Ala Leu Asp Arg Trp Glu Lys Arg Leu 

180 185 190 



Asp Asp Leu Phe Thr Gly Arg Pro Tyr Asp Met Leu Asp Ala Ala Leu 
195 200 205 



Ser Asp Thr lie Ser Lys Phe Pro He Asp He Gin Pro Phe Arg Asp 
210 215 220 



Met He Glu Gly Met Arg Ser Asp Leu Arg Lys Thr Arg Tyr Lys Asn 
225 230 235 * 240 



Phe Asp Glu Leu Tyr Met Tyr Cys Tyr Tyr Val Ala Gly Thr Val Gly 
245 250 255 



Leu Met Ser Val Pro Val Met Gly He Ala Pro Glu Ser Lys Ala Thr 
260 265 270 



Thr Glu Ser Val Tyr Ser Ala Ala Leu Ala Leu Gly He Ala Asn Gin 
275 280 285 



Leu Thr Asn He Leu Arg Asp Val Gly Glu Asp Ala Arg Arg Gly Arg 
290 295 * 300 



He Tyr Leu Pro Gin Asp Glu Leu Ala Glu Ala Gly Leu Ser Asp Glu 
305 310 315 320 



Asp He Phe Asn Gly Val Val Thr Asn Lys Trp Arg Ser Phe Met Lys 
325 330 335 



Arg Gin He Lys Arg Ala Arg Met Phe Phe Glu Glu Ala Glu Arg Gly 
340 345 350 



Val Thr Glu Leu Ser Gin Ala Ser Arg Trp Pro Val Trp Ala Ser Leu 
355 360 365 



Leu Leu Tyr Arg Gin He Leu Asp Glu He Glu Ala Asn Asp Tyr Asn 
370 375 380 



Asn Phe Thr Lys Arg Ala Tyr Val Gly Lys Ala Lys Lys Leu Leu Ala 
385 390 395 * 400 



Leu Pro Val Ala Tyr Gly Arg Ser Leu Leu Met Pro Tyr Ser Leu Arg 
405 410 415 
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Asn Ser Gin Lys 
420 

<210> 15 
<211> 1260 
<212> DWA 

<213> Capsicum annuum 



<400> 15 

atgtctgttg ccttgttatg 


ggttgtttct 


ccttgtgacg 


tctcaaacgg 


gacaggattc 


60 


^^oatafccco ttccftQaciQCi 

^^33 3 ^3 3 "3-3 3 


aaaccggatt 


tttgattcgt 


cggggcgtag 


gaatttggcg 


120 


tgcaatgaga gaatcaagag 


aggaggtgga 


aaacaaaggt 


ggagttttgg 


ttcttacttg 


180 


oaaaaaacac aaactQaaag 


tggacggaaa 


ttttctgtac 


gttctgctat 


cgtggctact 


240 


ccacjctQcjaa aaatgacgat 


gtcatcagaa 


cggatggtat 


atgatgtggt 


tttgaggcag 


300 


gcagccttgg tgaagagaca 


gctgagatcg 


accgatgagt 


tagatgtgaa 


gaaggatata 


- 3 60. 


cctattccgg ggactttggg 


cttgttgagt 


gaagcatatg 


ataggtgtag 


tgaagtatgt 


420 


gcagagtacg caaagacgtt 


ttacttagga 


acgatgctaa 


tgactccgga 


gagaagaaag 


480 


ar*t-a f*c?t"daa caatfataccjfc 


atggtgcagg 


agaacagacg aacttgttga 


tggtccgaat 


540 


gcatcacaca ttactccggc 


ggccttagat 


aggtgggaag 


acaggctaga 


agatgttttc 


600 


agtggacggc catttgacat 


gctcgatgct 


gctttgtccg 


acacagtttc 


caaatttcca 


660 


gttgatattc agccattcag 


agatatgatt 


gaaggaatgc 


gtatggactt 


gaggaagtca 


720 


agatacagaa actttgacga 


actataccta 


tattgttatt 


acgttgctgg 


tacggttggg 


780 


ttgatgagtg ttccaattat 


gggcatcgca 


cctgaatcaa 


aggcaacaac 


ggagagcgta 


840 


tataatgctg ctttggcttt 


ggggatcgca 


aatcagctga 


ccaacatact 


tagagatgtt 


900 


ggagaagatg ccagaagagg 


aagagtctat 


ttgcctcaag atgaattagc 


acaggcaggt 


960 


ctatccgacg aagacatatt 


tgctggaaga 


gtgaccgata 


aatggagaat 


cttcatgaag 


1020 


aaacaaattc agagggcaag 


aaagttcttt 


gacgaggcag 


agaaaggagt 


gaccgaattg 


1080 


agcgcagcta gtagatggcc 


tgtgttggca 


tctctgctgt 


tgtaccgcag 


gatactggac 


1140 


gagatcgaag ccaatgacta 


caacaacttc 


acaaagagag 


cttatgtgag 


caaaccaaag 


1200 


aagttgattg cattacctat 


tgcatatgca 


aaatctcttg 


tgccttctac 


aagaacatga 


1260 



<210> 16 
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<211> 1239 
<212> DNA 

<213> Lycopersicon esculentum 
<400> 16 

atgtctgttg ccttgttatg ggttgtttct ccttgtgacg tctcaaatgg gacaagtttc 60 

atggaatcag tccgggaggg aaaccgtttt tttgattcat cgaggcatag gaatttggtg 120 

tccaatgaga gaatcaatag aggtggtgga aagcaaacta ataatggacg gaaattttct 180 

gtacggtctg ctattttggc tactccatct ggagaacgga cgatgacatc ggaacagatg 240 

gtctatgatg tggttttgag gcaggcagcc ttggtgaaga ggcaactgag atctaccaat 300 

gagttagaag tgaagccgga tatacctatt ccggggaatt tgggcttgtt gagtgaagca 3 60 

tatgataggt gtggtgaagt atgtgcagag tatgcaaaga cgtttaactt aggaactatg 420 

ctaatgactc ccgagagaag aagggctatc tgggcaatat atgtatggtg cagaagaaca 480 

gatgaacttg ttgatggccc aaacgcatca tatattaccc cggcagcctt agataggtgg 540 

gaaaataggc tagaagatgt tttcaatggg cggccatttg acatgctcga tggtgctttg 600 

tccgatacag .bttctaactt tccagttgat attcagccat tcagagatat gattgaagga 660 

atgcgtatgg acttgagaaa atcgagatac aaaaacttcg acgaactata cctttattgt 720 

tattatgttg ctggtacggt tgggttgatg agtgttccaa ttatgggtat cgcccctgaa 780 

tcaaaggcaa caacagagag cgtatataat gctgctttgg ctctggggat cgcaaatcaa 840 

ttaactaaca tactcagaga tgttggagaa gatgccagaa gaggaagagt ctacttgcct 900 

caagatgaat tagcacaggc aggtctatcc gatgaagata tatttgctgg aagggtgacc 960 

gataaatgga gaatctttat gaagaaacaa atacataggg caagaaagtt ctttgatgag 1020 

gcagagaaag gcgtgacaga attgagctca gctagtagat tccctgtatg ggcatctttg 1080 

gtcttgtacc gcaaaatact agatgagatt gaagccaatg "actacaacaa cttcacaaag 1140 

agagcatatg tgagcaaatc aaagaagttg attgcattac ctattgcata tgcaaaatct 1200 

ctfcgtgcctc ctacaaaaac tgcctctctt caaagataa 1239 

<210> 17 

<211> 891 

<212> DWA 

<213> Erwinia sp. 
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<400> 17 

atggcagttg gctcgaaaag 


ttttgcgaca 


gectcaaagt 


tatttgatgc 


aaaaaccegg 


60 


cgcagcgtac 


tgatgctcta 


cgcctggtgc 


cgccattgtg 


acgatgttat 


tgacgatcag 


120 


acgctgggct ttcaggcccg 


gcagcctgcc 


ttacaaaege 


ccgaacaacg 


tetgatgeaa 


180 


cttgagatga 


aaacgcgcca 


ggcct atgca 


qqatcqcaqa 

ZJ ZJ ZJ —> 


tgcacgaacc 


ggcgtttgcg 


240 


gcttttcagg 


aagtggctat 


ggct ca tgat 


afccgccccgg 


ettacgegtt 


tgatcatctg 


300 


gaaggcttcg 


cgatggatgt 




caatacaocc 


aactggatga 


tacgctgcgc 


360 


tattgctatc 


acgttgcagg 






cgcaaatcat 


qqqcqtqcqq 

Z) ZJ ZJ ZJ ZJ Z2 Z3 


420 


gataacgcca 


cgctggaccg 


cgee fcgfcgac 




pat t- fcaatt 


gaccaatatt 


480 


gctcgcgata 


ttgtggacga 


cgcgcatgcy 




atctaccaac 


aaqctqqctq 


540 


gagcatgaag gtctgaacaa 


agagaattat 


gcggcacctg 


aaaacegtea 


ggegctgage 


600 


cgtatcgccc 


gacgtttggt 


gcaggaagca 


gaaccttact 


atttgtctgc 


cacagccggc 


660 


ctggcagggt 


tgcccctgcg 


ttccgcctgg 


geaategcta 


eggegaagea 


ggtttaccgg 


720 


aaaataggtg 


tcaaagttga 


acaggceggt 


cagcaagcct 


gggatcagcg 


gcagtcaacg 


780 


a.ccacgcccg 


aaaaattaac 


getgetgetg 


gccgcctctg 


gtcaggccct 


tacttcccgg 


840 


atgcgggctc 


atcctccccg 




cfcctqacaqc 


gcccgctcta 


g 


891 


<210> 18 














<211> 1479 






* 




* 


<212> DNA 














I 

<213> Erwinia sp . 












<400> 18 
atgaaaccaa 


ctacggtaat 


tqqtacaqqc 


ttcggtggcc 


tggcactggc 


aattegtcta 


60 


caagctgcgg 


ggatccccgt 


cttac tget t 


gaacaacgtg 


ataaaccegg 


eggteggget 


120 


tatgtctacg 


aggatcaggg 


gtttaccttt 


gatgeaggee 


eg acyy uuaL 


v_ d ^ — y a. X— k*, w 


180 


agtgccattg 


aagaactgtt 


tgcactggca 


ggaaaacagt 


taaaagagta 


tgtcgaactg 


240 


ctgccggtta 


cgccgtttta 


ccgcctgtgt 


tgggagtcag 


ggaaggtctt 


taattacgat 


300 


aacgatcaaa 


cccggctcga 


agegcagatt 


cagcagttta 


atccccgcga 


tgtcgaaggt 


360 


tatcgtcagt 


ttctggacta 


ttcacgcgcg 


gtgtttaaag 


aaggctatct 


gaagctcggt 


420 


actgtccctt 


ttttatcgtt 


cagagacatg 


cttcgcgccg 


cacctcaact 


ggcgaaactg 


480 
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caggcatgga gaagcgttta cagtaaggtt gccagttaca tcgaagatga acatctgcgc 

540 

caggcgtttt ctttccactc gctgttggtg ggcggcaatc ccttcgccac ctcatccatt . 600 

tatacgttga tacacgcgct ggagcgtgag tggggcgtct ggtttccgcg tggcggcacc 660 

ggcgcattag ttcaggggat gataaagctg tttcaggatc tgggtggcga agtcgtgtta 720 

aacgccagag tcagccatat ggaaacgaca ggaaacaaga ttgaagccgt gcatttagag 780 

gacggtcgca ggttcctgac gcaagccgtq gcgtcaaatg cagatgtggt tcatacctat 84 0 

cgcgacctgt taagccagca ccctgccgcg gttaagcagt ccaacaaact gcagactaag 900 

cgcatgagta actctctgtt tgtgctctat tttggtttga atcaccatca tgatcagctc 960 

gcgcatcaca cggtttgttt cggcccgcgt taccgcgagc tgattgacga aatttttaat 1020 

catgatggcc tcgcagagga cttctcactt tatctgcacg cgccctgtgt cacggattcg 10 80 

tcactggcgc ctgaaggttg cggcagttac tatgtgttgg cgccggtgcc gcatttaggc 1140 : 

accgcgaacc tcgactggac ggttgagggg ccaaaactac gcgaccgtat ttttgcgtac 12 00 

cttgagcagc attacatgcc tggcttacgg agtcagctgg tcacgcaccg gatgtttacg 1260 

ccgtttgatt ttcgcgacca gcttaatgcc tatcatggct cagccttttc tgtggagccc 1320 

gttcttaccc agagcgcctg gtttcggccg cataaccgcg ataaaaccat tactaatctc 13 80 

tacctggtcg gcgcaggcac gcatcccggc gcaggcattc ctggcgtcat cggctcggca 1440 

aaagcgacag caggtttgat gctggaggat ctgatttga 14 79 

<210> 19 
<211> 1488 
<212> DNA 
<213> Erwinia sp. 

<400> 19 

atggcggccg ccaaaccaac tacggtaatt ggtgcaggct tcggtggcct ggcactggca 60 

attcgtctac aagctgcggg gatccccgtc ttactgcttg aacaacgtga taaacccggc 120 

ggtcgggctt atgtctacga ggatcagggg tttacctttg atgcaggccc gacggttatc 180 

accgatccca gtgccattga agaactgttt gcactggcag gaaaacagtt aaaagagtat 240 

gtcgaactgc tgccggttac gccgttttac cgcctgtgtt gggagtcagg gaaggtcttt 3 00 

aattacgata acgatcaaac ccggctcgaa gcgcagattc agcagtttaa tccccgcgat 3 60 

gtcgaaggtt atcgtcagtt tctggactat tcacgcgcgg tgtttaaaga aggctatctg 420 
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aagctcggta ctgtcccttt tttatcgttc agagacatgc ttcgcgccgc acctcaactg 480 

gcgaaactgc aggcatggag aagcgtttac agtaaggttg ccagttacat cgaagatgaa 540 

catctgcgcc aggcgttttc tttccactcg ctgttggtgg gcggcaatcc cttcgccacc 6O0 

tcatccattt atacgttgat acacgcgctg gagcgtgagt ggggcgtctg gtttccgcgt 660 

ggcggcaccg gcgcattagt tcaggggatg ataaagctgt ttcaggatct gggtggcgaa 72 0 

gtcgtgttaa acgccagagt cagccatatg gaaacgacag gaaacaagat tgaagccgtg 780 

catttagagg acggtcgcag gttcctgacg caagccgtcg cgtcaaatgc agatgtggtt 84 0 

catacctatc gcgacctgtt aagccagcac cctgccgcgg ttaagcagtc caacaaactg 900 

cagactaagc gcatgagtaa ctctctgttt gtgctctatt ttggtttgaa tcaccatcat 960 

gatcagctcg cgcatcacac ggtttgtttc ggcccgcgtt accgcgagct gattgacgaa 1020 

atttttaatc atgatggcct cgcagaggac ttctcacttt atctgcacgc gccctgtgtc 1080 

acggattcgt cactggcgcc tgaaggttgc ggcagttact atgtgttggc gccggtgccg 1140 

catttaggca ccgcgaacct cgactggacg gttgaggggc caaaactacg cgaccgtatt - 1200 

tttgcgtacc ttgagcagca ttacatgcct ggcttacgga gtcagctggt cacgcaccgg 1260 

atgtttacgc cgtttgattt tcgcgaccag cttaatgcct atcatggctc agccttttct 132 0 

gtggagcccg ttcttaccca gagcgcctgg tttcggccgc ataaccgcga taaaaccatt 13 8 0 

actaatctct acctggtcgg cgcaggcacg catcccggcg caggcattcc tggcgtcatc 144 0 

ggctcggcaa aagcgacagc aggtttgatg ctggaggatc tgatttga 14 8 8 

<210> 20 
<211> 839 
<212> DNA 
<213> Oryza sp . 



<400> 20 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 60 

tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 120 

acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 180 

gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 240 

caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 300 

tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact catctacata 360 
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agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 420 

gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac 48 0 

tccagagcta tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 54 0 

aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 600 

gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 660 

atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 720 

ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 780 

cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaac 83 9 

<210> 21 

<211> 642 

<212> DNA 

<213> Oryza sp. 



<400> 21 

aagcttgcgc gcggaatacg gtggagatgg gttgggaacc ctggattcca aacacagccc 60 

aagtctatcc aaaatgttta gacaagaaaa tacgtaacaa gttggtttac agaaatacga 120 

attagatcaa tcctgcacta caagtagagt aaagtggtga tttctcttaa atctctcgaa 180 

tggtgattta agaattcagt gcaaaccaaa tccttgctat aatcaaatgt: tcggbaccgc 240 

atcaacggaa caataaaaag cgcctggcgt accataattt tgtcattctt gttgaaattt 300 

gtaatttaag atgcatgagg ccacacgacc ttaatgttca acgtgtcatg cattagtgaa 3 60 

ataatagctc acaaaacgca acaaatatag ctagataacg gttgcaatcc ttaccaaact 420 

aacgtataaa gtgagcgatg aghcatatca ttatctcccg cctgctaacc atcgtgtaca 480 

ccatccgatc acaaaaatga caacttctag ggatgaacct ggacaaggtt tagggtttag 540 

ggatgaatct ggacaaatga ttgttcaggt tcatccctag atgttggttt ctccfcgacgg 600 

gacggaggga gtatatgtga tggacacaaa agttactttc at 642 

<210> 22 
<211> 190 
<212> DNA 

<213> SYNTHETIC - 1NTR0N 
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<400> 22 

gtaaatttct agtttttctc cttcattttc ttggttagga cccttttctc t ttttatttt 60 
tttgagcttt gatctttctt taaactgatc tattt tttaa ttgattggtt atcgtgtaaa 120 
tattacatag ctttaactga taatctgatt actttatttc gtgtgtcttt gatcatcttg 180 
atagttacag 

<210> 23 

<211> 171 

<212> DNA 

<213> TRANSIT PEPTIDE 

<400> 23 



atggcttcta 


tgatatcctc 


ttccgctgtg 


acaacagtca 


gccgtgcctc 


tagggggcaa 


60 


tccgccgcag 


tggctccatt 


cggcggcctc 


aaatccatga 


ctggattccc 


agtgaagaag ' 


120 


gtcaacactg 


acattacttc 


cattacaagc 


aatggtggaa 


gagtaaagtg 


c 


171 


<210> 24 














<211> 254 














<212> DNA 














<213> NOS 


TERMINATOR 












<400> 24 
gatcgttcaa 


acatttggca 


ataaagtttc 


ttaagattga 


atcctgttgc 


eggtcttgeg 


60 


atgattatca 


tataatttct 


gttgaattac 


gttaagcatg 


taataattaa 


catgtaatgc 


120 


atgacgttat 


ttatgagatg 


ggtttttatg 


attagagtcc 


cgcaattata 


catttaatac 


180 


gcgatagaaa 


acaaaatata 


gcgcgcaaac 


taggataaat 


tatcgcgcgc 


ggtgtcatct 


240 


atgttactag 


at eg 










254 



<210> 25 

<211> 193 

<212> DNA 

<213> CAMV 35S TERMINATOR REGION 
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<400> 25 

gctgaaatca ccagtctctc tctacaaatc tatcfcctctc tataataatg tgtgagtagt 60 
tcccagataa gggaattagg gttcttatag ggtttcgctc atgtgttgag catataagaa 120 
acccttagta tgtatttgta tttgtaaaat acttctatca ataaaatttc taattcctaa 180 
aaccaaaatc cag 

<210> 26 

<211> 238 

<212> DNA 

<213> POTATO PROTEINASE INHIBITOR GENE TERMINATOR REGION 



<400> 26 

ccctagactt gtccatcttc tggattggcc aacttaatta atgtatgaaa taaaaggatg 60 

cacacatagt gacatgctaa tcactataat gtgggcatca aagttgtgtg ttatgtgtaa 120 

ttactaatta tctgaataag agaaagagat catccatatt tcttatccta aatgaatgtc 180 

acgtgtcttt ataattcttt gatgaaccag atgcatttta ttaaccaatt ccatatac 23 8 

<2lO> 27 

<211> 2321 

<212> DNA 

<213> Lycopersicon esculentum 



<400> 27 

gggtttatct cgcaagtgtg gctatggtgg gacgtgtcaa attttggatt gtagccaaac 60 

atgagatttg atttaaaggg aattggceaa atcaccgaaa gcaggcatct tcatcataaa 120 

ttagtttgtt tatttataca gaattatacg cttttactag ttatagcatt cggtatcttt 180 

ttctgggtaa ctgccaaacc accacaaatt tcaagtttcc atttaactct tcaacttcaa 240 

cccaaccaaa tttatttgct taattgtgca gaaccactcc ctatatcttc taggtgcttt 300 

cattcgttcc gagtaaaatg cctcaaattg gacttgtttc tgctgttaac ttgagagtcc 360 

aaggtagttc agcttatctt tggagctcga ggtcgtcttc tttgggaact gaaagtcgag 420 

atggttgctt gcaaaggaat tcgttatgtt ttgctggtag cgaatcaatg ggtcataagt 480 

taaagattcg tacfcccccat gccacgacca gaagattggt taaggacttg gggcctttaa 540 
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aggtcgtatg cattgattat ccaagaccag agctggacaa tacagttaac tatttggagg 

600 

ctgcattttt atcatcaacg ttccgtgctt ctccgcgccc aactaaacca ttggagattg 660 

ttattgctgg tgcaggtttg ggtggtttgt ctacagcaaa atatttggca gatgctggtc 720 

acaaaccgat actgctggag geaagggatg ttctaggtgg aaaggtagct gcatggaaag 780 

atgatgatgg agattggtac gagactggtt tgcatatatt ctttggggct tacccaaata 840 



ttcagaacct 


gtttggagaa 


ttagggatta 


acgatcgatt 


gcaatggaag 


gaacattcaa 


900 


tgatatttgc 


aatgccaagc 


aagccaggag 


aattcagccg 


ctttgatttc 


tccgaagctt 


960 


tacccgctcc 


tttaaatgga 


attttagcca 


tcttaaagaa 


taacgaaatg 


cttacatggc 


1020 


cagagaaagt 


caaatttgca 


attggactct 


tgccagcaat 


gcttggaggg 


caatcttatg 


1080 


ttgaagctca 


agatgggata 


agtgttaagg 


actggatgag 


aaagcaaggt 


gtgccggaca 


1140 


gggtgacaga 


tgaggtgttc 


attgctatgt 


caaaggcact 


caactttata 


aaccctgacg 


1200 


aactttcaat 


gcagtgcatt 


ttgatcgcat 


tgaacaggtt 


tcttcaggag 


aaacatggtt 


1260 


caaaaatggc 


ctttttagat 


ggtaatcctc 


ctgagagact 


ttgcatgccg 


attgttgaac 


1320 


acattgagtc 


aaaaggtggc 


caagtcagac 


tgaactcacg 


aataaaaaag 


attgagctga 


1380 


atgaggatgg 


aagtgtcaag 


agttttatac 


tgagtgacgg 


tagtgcaatc 


gagggagatg 


1440 


cttttgtgtt 


tgccgctcca 


gtggatattt 


tcaagcttct 


attgcctgaa 


gactggaaag 


1500 


agattccata 


tttccaaaag 


ttggagaagt 


tagtcggagt 


acctgtgata 


aatgtacata 


1560 


tatggfcttga 


cagaaaactg 


aagaacacat 


atgatcattt 


gctcttcagc 


agaagctcac 


1620 


tgctcagtgt 


gtatgctgac 


atgtctgtta 


catgtaagga 


atattacaac 


cccaatcagt 


1680 


cfcatgttgga 


attggttttt 


gcacctgcag 


aacxacrtQciat 


atct cgcagc 


gactcagaaa 


1740 


ttattgatgc 


aacgatgaag 


gaactagcaa 


cgctttttcc 


tgatgaaatt 


tcagcagatc 


1800 


aaagcaaagc 


aaaaatattg 


aagtaccatg 


ttgtcaaaac 


tccgaggtct 


gtttataaaa 


1860 


ctgtgccagg 


ttgtgaaccc tgtcggcctt 


tacaaagatc 


cccaatagag 


gggttttatt 


1920 


tagccggtga 


ctacacgaaa 


cagaaatact 


tggcttcaat 


ggaaggcgct 


gtcttatcag 


1980 


gaaagctttg 


tgctcaagct 


attgtacagg 


attatgagtt 


acttgttgga 


cgtagccaaa 


2040 


agaagttgtc 


ggaagcaagc 


gtagtttagc 


tttgtggtta 


ttatttagct 


tctgtacact 


2100 


aaat ttatga 


tgcaagaagc 


gttgtacaca 


acatatagaa 


gaagagtgcg 


aggtgaagca 


2160 


agtaggagaa 


atgttaggaa 


agctcctata 


caaaaggatg 


gcatgttgaa 


gattagcatc 


2220 


tttttaatcc 


caagtttaaa 


tataaagcat 


attttatgta 


ccactttctt 


tatctggggt 


2280 


ttgtaatccc 


tttatatctt 


tatgcaatct 


ttacgttagt 


t 




2321 
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<210> 28 
<211> 1749 
<212> DMA 

<213> Capsicum anrmum 



<400> 28 

atgccccaaa ttggacttgt ttctgctgtc aacttgagag tccaaggtaa ttcagcttat 60 

ctttggagct cgaggtcttc tttgggaact gatagtcaag atggttgctc gcaaaggaat 12 0 

tcgttatgtt ttggtggtag tgactcaatg agtcataggt taaagattcg taatccccat 180 

tccataacga gaagattggc taaggatttc cggcctttaa aggttgtttg cattgattat 240 

ccaaggccag agctagacaa tacagttaac tatttggagg ctgcattctt atcatcatca 300 

ttccgatctt ctccgcgccc aaccaaacca ctggagattg ttattgctgg tgcaggtttg 360 

Sgtggtttgt ctacagcaaa atatttggca gatgctggtc acaaaccaat actgctggag 420 

gcaagggatg ttctaggtgg aaaggtagct gcatggaaag atgatgatgg agattggtat 480 

gagactggtt tgcacatatt ctttggggct tacccaaata tgcagaacct atttggagaa 540 

ttagggataa atgatcgatt gcaatggaag gaacattcga tgatatttgc aatgccaaac 600 

aagccaggag aattcagccg ctttgatttc cccgaagctt tacctgctcc tttaaatgga 660 

attttggcaa tcctaaagaa caatgaaatg cttacatggc cagaaaaagt caaatttgca 72 0 

attggactct tgccagcaat gcttggtggg caatcttatg ttgaagctca agacgggata 780 

agtgttaagg actggahgag aaaacaaggt gtgccggata gggtgacgga tgaggtgttc 840 

atcgccatgt caaaggcact taacttcata aatcctgatg agctttcgat gcagtgcatc 900 

ttgatcgcgt tgaacagatt tcttcaggag aaacatggtt caaaaatggc ctttttagat 960 

ggtaatcctc ctgagagact ttgcatgccg attgttgaac atatcgagtc aaaaggtgga 102 0 

caagtcagac tgaactcacg aataaaaaag attgagctga atgaggatgg aagtgtcaag 1080 

tgttttatac tgaacgatgg tagtacaatt gagggagatg cttttgtgtt tgcgactcca 114 0 

gtggatattt tcaagcttct tttgcctgaa gactggaaag agattccata tttccaaaag 1200 

ttggagaagt tagttggagt acctgtgata aatgtccata tatggtttga cagaaaactg 1260 

aagaacacat ctgataattt gctcttcagc agaagcccac tgctcagtgt gtatgctgac 1320 

atgtccgtca catgtaagga atattacgac cccaacaagt ccatgttgga attggtcttt 1380 

gcgcctgcag aagagtgggt atctcgcagt gactctgaaa ttattgatgc tacaatgaag 1440 
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gaactagcaa 


agctatttcc 


tgatgaaatt 


tcggcggatc agagcaaagc 


aaaaatattg 


1500 


aagtatcatg 


ttgtcaaaac 


tccaaggtct 


gtatataaaa ctgtgccagg 


ttgtgaaccc 


1560- 


tgtcggctct 


ugcaaagauc 


ccctgtagag 


gggttttatt tagctggtga ctacacgaaa 


1620 


caga.aat:act 


tggcttcaat 


ggaaggtgct gtcttatcag gaaagctttg tgcacaagct 


1680 


attgtacagg 


attacgagtt 


acttgttggc 


cggagccaga ggaagttggc 


agaaacaagt 


174 0 


gtagtttag 










1749 


<210> 29 












<2X1> 2264 










<212> DNA 












<213> Zea 


mays 










<40O> 29 
ctccaaatgc 


ggaggtctcg 


actcttctct 


cttcctccat ctttatcatc 


gccccacgta 


60 


cacacccaat 


tcctcgcaac 


tgggctcccc 


cgcctccacg acactgcccc 


ccgtctcaag 


120 


tccgccgcct 


ccattcttca 


gctctcctat cctccgccta gaatatcttc atcggtattt 


180 


taccaacctg 


gatcaattta 


ctcacgatac 


tctgaagcgt atacatatgc 


catatgggaa 


240 


atgacttcat 


agctgtgggt 


tgtcttatgg ctccttgaat ttgcagtagt ctgcctgtac 


3 00 


ctattggctg 


aagcagagct 


gacccccact 


ttatcaagag ttgctcaacg 


atggacactg 


3 60 


gctgcctgtc 


atctatgaat 


attactggag 


ctagccagac aagatctttt 


gcggggcaac 


420 


ttcctcctca 


gagatgtttt 


gcgagtagtc 


actatacaag ctttgccgtg 


aaaaaacttg 


480 


tctcaaggaa 


taaaggaagg 


agatcacacc 


gtagacatcc tgccttgcag 


gttgtctgca 


540 


aggattttcc 


aagacctcca 


ctagaaagca 


caataaacta tttggaagct- 


ggacagctct 


600 


cttcattttt 


tagaaacagc 


gaacgcccca 


gtaagccgtt gcaggtcgtg 


gttgctggtg 


660 


caggattggc 


tggtctatca 


acagcgaagt atctggcaga tgctggccat 


aaacccatat 


720 


tgcttgaggc 


aagagatgtt 


ttgggtggaa 


aggtagctgc ttggaaggat 


gaagatggag 


780 


atfcggtacga 


gactgggctt 


catatatttt 


ttggagctta tcccaacata 


cagaatctgt 


84 0 


ttggcgagct 


taggattgag 


gatcgtttgc 


agtggaaaga acactctatg 


atattcgcca 


900 


tgccaaacaa 


gccaggagaa 


ttcagccggt 


tcgatttccc agaaactttg 


ccagcaccta 


960 


taaatgggat 


atgggccata 


ttgagaaaca 


atgaaatgct tacttggccg 


gagaaggtga 


1020 


agtttgcaat 


cggacttctg 


ccagcaatgg 


ttggtggtca accttatgtt 


gaagctcaag 


1080 
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atggcttaac cgtttcagaa tggatgaaaa agcagggtgt tcctgatcgg gtgaacgatg 1140 

aggtttttat tgcaatgtcc aaggcactca atttcataaa tcctgatgag ctatctatgc 1200 

agtgcatttt gattgctttg aaccgatttc ttcaggagaa gcatggttct aaaatggcat 12 60 

tcttggafcgg taatccgcct gaaaggctat gcatgcctat tgttgatcac attcggtcta 132 0 

ggggtggag a ggtccgcctg aattctcgta ttaaaaagat agagctgaat cctgatggaa 13 8 0 

ctgtaaaaca cttcgcactt agtgatggaa ctcaaataac tggagatgct tatgtttgtg 144 0 

caacaccagt cgatatcttc aagcttcttg tacctcaaga gtggagtgaa attacttatt 1500 

tcaagaaact ggagaagttg gtgggagttc ctgttatcaa tgttcatata tggtttgaca 1560 

gaaaactgaa caacacatat gaccaccttc ttttcagcag gagttcactt ttaagtgtct 1620 

atgcagacat gtcagtaacc tgcaaggaat actatgaccc aaaccgttca atgctggagt 1680 

tggtctttgc tcctgcagac gaatggattg gtcgaagtga cactgaaatc atcgatgcaa 174 0 

ctatggaaga gctagccaag ttatttcctg atgaaattgc tgctgatcag agtaaagcaa 1800 

agattcttaa gtatcatatt gtgaagacac cgagatcggt ttacaaaact gtcccaaact 18 60 

gtgagccttg ccggcctctc caaaggtcac ctatcgaagg tttctatcta gctggtgatt 1920 

acacaaagca gaaatacctg gcttctatgg aaggtgcagt cctatccggg aagctttgtg 1980 

cccagtccat agtgcaggat tatagcaggc tcgcactcag gagccagaaa agcctacaat 2040 

caggagaagt tcccgtccca tcttagttgt agttggcttt agctatcgtc atccccactg 2100 

ggtgctatct tatctcctat ttcaatggga acccacccaa tggtcatgtt ggagacaaca 2160 

cctgttatgg tcctttgacc atctcgtggt gactgtagtt gatgtcatat tcggatatat 2220 

atgtaaaagg acctgcatag caattgttag accttggaaa aaaa 2264 

<210> 30 
<211> 2027 
<212> DNA 
<213> Oryza sp. 

<400> 30 

gtttatgaca gcatctgcca gatattttgc aggacaactt cctactcata ggtgcttcgc 60 

aagtagcagc atccaagcac tgaaaggtag tcagcatgtg agctttggag tgaaatctct , 120 

tgtcttaagg aataaaggaa aaagattccg tcggaggctc ggtgctctac aggttgtttg 180 

ccaggacttt ccaagacctc cactagaaaa cacaataaac tttttggaag ctggacaact 240 
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atcctcat tt 


ttcagaaaca 


gtgaacaacc 


cactaaacca 


ttacaggtcg 


tgattgctgg 


300 


agcaggatta gctggtttat 


caacggcaaa 


atatctggca 


gatgctggtc 


ataaacccat 


360 


attgcttgag 


gcaagggatg 


ttttgggtgg 


aaagatagct 


gcttggaagg 


atgaagatgg 


420 


agattggtat 


gaaactgggc 


ttcatatctt 


ttttggagct 


tatcccaaca 


tacagaactt 


480 


gtttggcgag 


cttggtatta 


atgatcggtt 


gcaatggaag 


gaacactcca 


tgatatttgc 


540 


catgccaaac 


aagccaggag 


aatccagccg 


gtttgatttt 


cctgaaacat 


tgcctgcacc 


600 


cttaaatgga 


atatgggcca 


tactaagaaa 


caatgaaatg 


ctaacttggc 


cagagaaggt 


660 


gaagtttgct 


cttggacttt 


tgccagcaat 


ggttggtggc 


caagcttatg 


ttgaagctca 


720 


agatggtttt 


actgtttctg 


agtggatgaa 


aaagcagggt 


gttcctgatc 


gagtgaacga 


780 


tgaagttttc 


attgcaatgt 


caaaggcact 


taatttcata 


a.atcctgatg 


agttatccat 


8.4 0 


gcagtgcatt 


ctgattgctt 


taaaccgatt 


tcttcaggag 


aagcatggtt 


ctaagatggc 


900 


attcttggat 


ggtaatcctc 


ctgaaaggtt 


atgcatgcct 


attgttgacc 


atgttcgctc 


960 


tttgggtggt 


gaggttcggc 


tgaattctcg 


tattcagaaa 


atagaactta 


atcctgatgg 


1020 


aacagtgaaa 


cactttgcac 


ttaccgatgg 


aactcaaata 


actggagatg 


cttatgtttt 


1080 


tgcaacacca 


gttgatatct 


tgaagcttct 


tgtacctcaa 


gagtggaaag 


aaatatctta 


1140 


tttcaagaag 


ctggagaagt 


tggtgggagt 


tcctgttata 


aatgttcata 


tatggtttga 


1200 


tagaaaactg 


aagaacacat 


atgaccacct 


tcttttcagc 


aggagttcac 


ttttaagtgt 


1260 


ttatgcggac 


atgtcagtaa 


cttgcaagga 


atactatgat 


ccaagccgtt 


caatgctgga 


1320 


gttggtcttt 


gctcctgcag 


aggaatgggt 


tggacggagt 


gacactgaaa 


tcatcgaagc 


1380 


aactatgcaa 


gagctagcca 


agctatttcc 


tgatgaaatt 


gctgctgatc 


agagtaaagc 


1440 


aaagattctg 


aagtatcatg 


ttgtgaagac 


accaagatct 


gtttacaaga 


ctatcccgga 


1500 


ctgtgaacct 


tgccgacctc 


tgcaaagatc 


accgattgaa 


gggttctatc 


tagctggtga 


1560 


ctacacaaag 


cagaaatatt 


tggcttcgat 


ggagggtgca 


gttctatctg 


ggaagctttg 


1620 


tgctcagtct 


gtagtggagg 


attataaaat 


gctatctcgt 


aggagcctga 


aaagtctgca 


1680 


gtccgaagtt 


cctgttgcct 


cctagttgta 


gtcaggacta 


ttcccaatgg 


tgtgtgtgtc 


1740 


atcatcccct 


agtcagtttt 


tttctattta 


gtgggtgccc 


aactctccac 


caatttacac 


1800 


atgatggaac 


ttgaaagatg 


cctattttgg 


tcttatcata 


tttctgtaaa 


gttgatttgt 


1860 


gactgagagc 


tgatgccgat 


atgccacgct 


ggagaaaaag 


aacattatgt 


aaaacgacct 


1920 


gcatagtaat 


tcttagactt 


ttgcaaaagg 


caaaaggggt 


aaagcgacct 


tttttttcta 


1980 


tgtgaaggga 


ttaagagacc 


ttaaaaaaaa 


aaaaaaaaaa 


aaaaaaa 




2027 
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<210> 31 
<2X1> 1931 
<212> DNA 

<213> Lycopersicon esculentum 



<400> 31 

ttttgtcttt ctttcttgtt aacccatttt cttgatattt aacaagaaaa gtttctttct 60 

tttttttcct accctcataa ttgggtagag aacaattccc atggctactt cttcagctta 120 

tctttcttgt cctgcaactt ctgctactgg aaagaaacat gttttcccaa atgggtcacc 180 

tggattcttg gtttttggtg gtacccgttt gtccaaccgg ttagtgaccc gaaagtcggt 240 

tattcgggct gatttggatt ctatggtttc tgatatgagt accaacgctc caaaagggct 300 

atttccaccc gagcctgaac. attatcgggg gccaaagctg aaagtagcta ttattggagc 360 

tgggcttgca ggcatgtcga ctgctgtgga gctcttggat caaggacatg aggtggatat 420 

atacgaatca aggactttta ttggtgggaa agtgggttct tttgttgata gacgtgggaa 4 80 

ccacattgaa atgggactgc acgtgttctt tggttgttat aataatctgt tccgtctgtt 540 

gaaaaaggtg ggtgctgaaa aaaatctgct agtgaaggag catactcaca catttgtaaa 600 

taaagggggt gaaatagggg aacttgattt ccgctttcca gttggagcac ccttacatgg 660 

aattaatgca tttctgtcta ctaatcagtt aaagatttat gataaagcta gaaatgctgt 720 

agctcttgcc cttagtccag tggtgcgggc tttagttgat ccggatggtg cattgcagca 780 

gatacgcgat ctagataatg taagcttttc tgagtggttt ctgtctaaag gtgggacgcg 840 

tgctagcatc cagaggatgt gggatcctgt tgcatatgcfc cttggattca ttgactgtga 900 

taacatgagt gctcggtgta tgctcactat atttgcatta tttgccacaa aaacagaggc 960 

ttccctatta cgcatgctta aaggttctcc tgacgtttat ttgagtggtc caattaagaa 1020 

gtacatcatg gacaaagggg gcaggttcca tctgaggtgg ggatgcagag aggtactcta 1080 

tgagacgtcc tctgatggaa gcatgtatgt tagtgggctt gccatgtcaa aggccactca 1140 

gaagaaaatt gtaaaagctg atgcatatg.t ggctgcatgt gatgtccctg gaattaaaag 12 00 

attggttcct cagaagtgga gggaattgga attctttgac aacatttaca aattggtcgg 1260 

agtgcctgtt gttaccgtac aactacgcta caatggctgg gttacagagt tgcaggactt 1320 

ggagcgttcg aggcaattga agcgcgctgc aggattggac aatctcctct atacgccaga 13 80 

tgcagatttc tcttgctttg cagatcttgc attggcatct ccagatgatt actacattga 1440 
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gggacaaggc tcattgcttc aatgtgtcct tacacctggt gacccttaca tgcctctatc 1500 

aaatgatgaa atcattaaaa gagttacaaa gcaggttttg gcattatttc cttcgtccca 1560 

aggtcttgag gttacctggt catcagtttt gaagatagga caatctttat atcgtgaagg 1620 

acctggtaaa gacccattca gacctgatca gaagacgcca gtggaaaatt tctttcttgc 1680 

tggctcatat acaaaacagg actacatcga tagcatggaa ggagcaactc tttcaggtag 1740 

gcaagcttct gcatacatat gtaatgttgg agagcagctg atggcgttgc gtaaaaagat 1800 

cactgctgct gagttgaatg acatctctaa aggtgtgtcc ctatctgatg agttgagtct 1860 

tgtctgatga cagactgcaa atcatccaaa t.acaactcag ttaggcatcg cacaaggaag 192 0 

aattcttcta a X931 

<210> 32 
<211> 1982 
<212> DNA 

<213> Capsicum annuurn 



<400> 32 
cacaattcta 


tggctacttg 


ttcagcttat 


ctttgttgtc ctgccacttc 


tgcttcttta 


60 


aagaaacgtg 


tfctttccaga 


tgggtccgct 


ggattcttgt tttttggtgg 


tcgtcgtttg 


12 0 


tcgaaccggt 


tagtgacccc 


aaagtctgtc 


atccgagctg atttgaactc 


catggtctct 


180 


gacatgagta 


ccaacgctcc 


aaaagggcta 


tttccacctg aacctgaaca 


ttatcggggg 


240 


ccaaagctga 


aagtagctat 


tattggagct 


ggccttgcag gcatgtcgac 


tgctgtggag 


300 


ctcttggatc 


aaggacatga 


ggtggatata 


tatgaatcaa ggaccttcat 


tggtgggaaa 


360 


gtgggttctt 


ttgttgataa 


acgtgggaac 


cacattgaaa tgggactgca 


cgtgttcttt 


420 


ggttgctata 


ataatctatt 


ccgtctgatg 


aaaaaggtgg gtgctgaaaa 


aaatctgcta 


480 


gtgaaggagc 


atactcacac 


atttgtaaat 


aaagggggtg aaatagggga 


gcttgatttc 


540 


cgctttccag 


ttggagcgcc 


cttacatgga 


attaatgcat ttttgtctac 


taatcaacta 


600 


aagacttatg 


ataaagctag 


aaatgctgta 


gctcttgccc ttagtccagt 


ggtgcgggct 


660 


ttagttgatc 


cagatggcgc 


attgcagcag 


atacgtgatc tagatagtgt 


aagcttttct 


720 


gattggttta 


tgtctaaagg 


a ggg ac g c gc 


gctagcatcc agaggatgtg 


ggatcctgtt 


780 


gcatatgctc 


ttggattcat 


tgactgtgac 


aatatcagtg ctcggtgtat 


gctcactata 


840 


tttgcattat 


ttgccactaa 


aacggaggct 


tccctactgc gcatgcttaa 


aggttctcct 


900 
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gacgtttatt tgagtggtcc aattaagaag tacatcatag acaagggggg aaggttccat 



960 



ctgaggtggg gatgcagaga ggtactctac gagacatcct ctgatggaag catgtatgtt 1020 

agcgggcttg ccatgtcaaa ggccactcag aagaaaattg taaaagctga tgcctatgtt 1080 

gccgcatgtg tagtacctgg aattaaaaga ttagtacctc agaagtggag ggaattggaa 1140 

ttctttggca acatttacaa actgattgga gtgcctgttg ttactgtgca actacgatac 12 00 

aatggctggg ttacggagtt gcaggacttg gagcgttcaa ggcaatcaaa gcgcgctaca 12 60 

ggtttggaca atctcctgta cacgccagat gcagatttct cttgttttgc agaccttgca 1320 

ttggcatctc cagaagatta ttacattgag ggacaaggct cgttgcttca atgtgtcctt 13 80 

acgcctggcg acccttacat gcctctacca aatgaagaaa tcataagaag agtgtcaaag 1440 

caggttttgg cgttatttcc ttcttcccaa ggtcttgagg taacctggtc atcagttgtg 1500 

aagattgggc aatccttata tcgtgaagga cctggtaaag acccgttcag acctgatcaa 1560 

aagacgccag tggaaaattt ctttcttgct ggctcatata caaaacagga ctacatcgat 1620 

agtatggaag gggcaactct ttcaggcaga caagcttctg catacatatg tgatgctgga 1680 

gagcagctgt tggcgctgcg aaaaaagatt gctgctgctg agttaaacga gatctctaaa 1740 

ggtgtatcgc tatcggatga gttgagtctt gtctgatgac tgcaaatcat tcagaaatat 1800 

aattcagtta ggcagtgcat aaggaagaat tcttctaaat ttttgagtct cacaattatg 1860 

gaaatcaaaa tatgttttaa aaatgttgta tgtatgtaat attagtaaat cttcatagtg 1920 

atgtatgtat ctattctgcc acgcttcagt ttagtgaaat ggaacttatt gctgcatcaa 1980 

tc 1982 

<210> 33 
<211> 2265 
<212> DNA 
<213> Zea mays 



<400> 33 

ccctgccacg acgcccgcga caaatccctg cgcgacggca tcttcgcctc ccatcccctc 



60 



ccagcttccc ctcccactcc ggccctcaca caaattgccc ctcttcttct cctcctcttt 



120 



acacgctgcc gaccacggct gccgccaacc acccgcccca cccgtccacc gctgccgagt 



180 



gctagccatt tggagctgcc gcgccatggc gtccgtggcc gccaccacca cgctggcacc 



240 



ggcactcgcc ccgcgccggg cgcggccagg gactgggctc gtgccgccgc gccgggcctc 



300 
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ggccgtcgct 


gctcgctcga 


ccgtaacgtc 


tccgacatgg 


cgtcaacgct 


cccaaaggtt 


360 


attcccaccc 


gagccagagc 


actacagggg 


cccgaagctc 


aaggtggcca 


tcataggggc 


420 


aggccttgcg 


ggcatgtcca 


ccgctgttga 


gctcttggac 


cagggccatg 


aggttgattt 


480 


gtacgagtcc 


cgtccgttta 


tcggtggcaa 


ggttggctcc 


tttgttgaca 


ggcaaggaaa 


540 


ccatatcgag 


atggggctgc 


atgtgttctt 


cgggtgctac 


agcaatctct 


tccgcctcat 


600 


gaagaaggtt 


ggcgctgata 


ataatctgct 


ggtgaaggaa 


catacccata 


cttttgtaaa 


660 


taaagggggc 


acgattggtg 


aacttgattt 


tcggttcccg 


gtgggagctc 


cgttacatgg 


720 


cattcaagca 


ttcctaagaa 


ctaatcagct 


caaggtttat 


gataaagcaa 


gaaatgcagt 


780 


tgctcttgcc 


cttagtccag 


ttgttcgggc 


tctggttgat 


cctgatggtg 


cattgcagca 


840 


agtgcgggac 


ttggatgata 


taagtttcag, 


tgattggttc 


atgtccaaag 


ggggtactcg 


900 


ggagagtatc 


acaagaatgt 


gggatc'ctgt 


tcgttacgct 


ttgggtttca 


ttgactgtga 


960 


taatatcagt 


gcacgttgca tgcttactat tttcaccttg tttgccacaa 


agacagaggc 


1020 


atccctgtta 


cgcatgttaa 


agggttcacc 


tgatgtttac 


ttaagtggtc 


caataaagaa 


1080 


gtatataaca 


gacaggggtg 


gtaggtttca 


cttaaggtgg 


ggatgcagag 


aggttctcta 


1140 


tgagaagtca 


cctgatggag agacctatgt taagggcctt 


ctactcacca 


aggctacaag 


1200 


tagagagata 


atcaaagctg 


atgcatacgt 


cgcagcctgt 


gatgttccag 


gtatcaaaag . 


1260 


attacttcca 


tcagaatgga 


gggagtggga 


aatgtttgac 


aatatctaca 


agttagatgg 


1320 


tgtccctgtt 


gtcactgtcc 


agctccgcta 


caacggatgg 


gtcactgaac 


ttcaagattt 


1380 


ggagaaatca 


agacaactgc 


aaagggcggt 


tgggttggat 


aaccttttgt 


acacggcgga 


1440 


tgcagacttt 


tcctgttttt 


cggaccttgc 


tctctcatct 


cctgctgatt 


actacattga 


1500 


agggcaaggt 


tccctgatcc 


aagctgtgct 


gactcctgga gatccataca 


tgccattgcc 


1560 


aaacgag'gag 


atcattagta 


aggttcaaaa 


gcaggttgta 


gaactgttcc 


catcttcccg 


1620 


gggcttagaa 


gttacatggt 


ccagtgtggt 

i 


aaagatcgga 


caatcgctgt 


accgtgaggc 


1680 


tcctggaaac 


gacccattca ggcctgatca gaagacgccc gttaaaaact 


tcttcctctc 


1740 


tggatcttac 


acgaaacagg 


actacatcga 


cagcatggaa 


ggagcaactc 


tctccggcag 


1800 


gcgaacgtcg 


gcctacatct 


gcggtgccgg 


ggaggagctg 


ctggccctcc 


gaaagaagct 


1860 


actcatcgac 


gacggcgaga 


aggcgctggg 


gaacgttcaa 


gtcctgcagg 


ctagctgaac 


1920 


aacccctcct 


gcactgcaga 


gaagcttgga 


tctttccaac 


cacacataca 


tgctggaatg 


1980 


gacaaaccaa 


ccaaccattg 


tcttttctcg 


cttcagggtg 


ctggcgattc 


ccgcagcaac 


2040 


ctcctgtgta 


tcgtatccaa 


tttgagcatt 


agatctgccc 


cccccccctg 


caggcgtttc 


2100 
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tttcctatcc ctgatccgag aagcagggtg tagtctaggt ggctggcata cgggattaca. 2160 

tcaggcagtg tgtaagttca gctggaactc gattggtaat tgggatggat gattgatgat 2220 

atafcatatag cacacactgt tcttgcgtct tgcaaaaaaa aaaaa 2265 

<210> 34 
<211> 1632 
<212> DNA 
<213> Oryza sp. 

<400> 34 

cccacgcgtc cgcccacgcg tccggattgg tgaacttgat tttcggtttc ctgtgggagc 60 

tccgttacat ggtatccaag cattcctacg aactaaccaa ctcaaggttt atgataaagc 120 

aagaaatgcc gttgctcttg ctctaagccc agttgttcga gctcttgttg atccagatgg 180 

tgcattgcag caagtacggg atttggatga tgtaagtttc agcgattggt tcttgtcgaa 24 0 

aggtggtact cgagagagca tcacaaggat gtgggatcct gttgcctatg ctcttggttt 3 00 

cattgactgt gataatatca gtgcacgttg catgcttacc attttcactc tgtttgccac 3 60 

aaaaacagag gcatctttat tacgcatgct aaagggttca cctgatgttt atctgagtgg 420 

tccaataaag aagtacataa cagacagggg tggtaggttt cacctgaggt ggggatgtag 480 

ggaggttctc tatgataagt cacctgatgg ggaaacctat gttaaaggcc ttctcctatc 540 

caaggctaca agtagagaga taatcaaagc agatgcatat gtcgcagctt gtgatgtccc 600 

ggggatcaaa agacttttac cttctgaatg gaggcaatgg gatacatttg acaacatcta 660 

caagttagat ggtgttcctg tagtcacagt acagcttcgt tataatggat gggttacaga 720 

acttcaagat ttggagaaat caagacaact gaaaaaggca gttggcttgg ataatcttct 780 

ctacactcca gatgcagatt tttcatg'ttt ttcagacctt gcactttcat ctcctgctga 840 

ctactacatt gaaggacaag gttccttgat ccaagctgtg ctaacccctg gcgatcctta 900 

catgccattg ccgaatgagg agataattag caaggttcaa aagcaggtct tagaattgtt 960 

cccgtcatca caaggcttgg aacttacatg gtcgagtgtg gtgaaaatcg gtcaatcatt 1020 

gtaccgcgag tcaccaggaa atgatccatt tagacctgat caaaagacac cagttaaaaa 1080 

cttcttcctg tctggctctt acacaaaaca ggactacatt gacagcatgg aaggggcaac 1140 

tctctcaggc aggagaaccg cggcctacat ctgtggtgca ggagaggagc tgcttcgccc 120 0 

tccgaaagaa gctcattgtc gacgacagcg gagaaggcca ggggtaaggt cgacggccct 1260 
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tcagacaagc tgagcttcct caaatgacac atgctggagt gagtggattg ctatgcccaa 1320 

aacaaaaaca gcttcctggg tgtagtaggc gatttccgca gcgactctca tgtaaatcct 1380 

acttgattga gcatttaggt ccaatctgct gctgcccttt ttgccttgac acgatcgttc 1440 

gttcgcccgt caatggtgtg ttcttcgtta ttgtgaattt gtgattggga accaaaggtg 1500 

gcatacggga ttacatcagg cagcgtgtgt tttgttcagc ttaaccgatc attgaaccca 1560 

ttgatgatga tgatgatgtt tatatagtgc acacatcact taaaaaaaaa aaaaaaaaaa 162 0 

aaaaaaaaaa aa 1632 

<210> 35 

<211> 40 

<212> DNA 

<213> PRIMER 



<400> 35 

cgtcggcctg catggcccta cttctggcta tttctcagtg 4 0 

<210> 36 

<211> 26 

<212> DNA 

<213> PRIMER 



<400> 36 

ctgtccatgg cggccatcac gctcct 26 

<210> 37 

<211> 40 

<212> DNA 

<213> PRIMER 



<400> 37 

cgatggcctg catggcccta ggtctggcca tttctcaatg . 40 
<210> 38 
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<211> 32 
<212> DNA ; 

<213> PRIMER 



<400> 38 

taggataaga tagcaaatcc atggccatca ta 



32 
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<400> 2 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 60 

tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 120 

acaatatcat gagtaatgtg^ tgagcattat gggaccacga aataaaaaga acattttgat 180 

gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 24 0 

caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 3 00 

tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact catctacata 360 

agtatcttca gctaaatgtt agaacataaa cccataagtc acgtfctgafcg agtattaggc 42 0 

gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac 480 

tccagagcta tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 54 0 

aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 600 

gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 660 

atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 720 

ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 78 0 

cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 84 0 

aattcggctt cccgggtaca gggtaaattt ctagtttttc tccttcattt tcttggttag 90 0 

gacccttttc tctttttatt tttttgagct ttgatctttc tttaaactga tctatttttt 960 

aattgattgg ttatcgtgta aatattacat agctttaact gataatctga ttactttatt 1020 

tcgtgtgtct ttgatcatct tgatagttac agaaccgtcg actctagaga agccatttaa 1080 

atcgccgcca ccatggcttc tatgatatcc tcttccgctg tgacaacagt cagccgtgcc 114 0 

tctagggggc aatccgccgc agtggctcca ttcggcggcc tcaaatccat gactggattc 12 0 0 

ccagtgaaga aggtcaacac tgacattact tccattacaa gcaatggtgg aagagtaaag 126 0 

tgcatgaaac caactacggt aattggtgca ggcttcggtg gcctggcact ggcaattcgt 132 0 

ctacaagctg cggggatccc cgtcttactg cttgaacaac gtgataaacc cggcggtcgg 1380 

gcttatgtct acgaggatca ggggtttacc tttgatgcag gcccgacggt tatcaccgat 144 0 

cccagtgcca ttgaagaact gtttgcactg gcaggaaaac agttaaaaga gtatgtcgaa 150 0 

ctgctgccgg ttacgccgtt ttaccgcctg tgttgggagt cagggaaggt ctttaattac 1560 

gataacgatc aaacccggct cgaagcgcag attcagcagt ttaatccccg cgatgtcgaa 1620 

ggttatcgtc agtttctgga ctattcacgc gcggtgttta aagaaggcta tctgaagctc 1680 

ggtactgtcc cttttttatc gttcagagac atgcttcgcg ccgcacctca actggcgaaa 1740 
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ctgcaggcat ggagaagcgt ttacagtaag gttgccagtt acatcgaaga tgaacatctq 

1800 

cgccaggcgt tttctttcca ctcgctgttg gtgggcggca atcccttcgc cacctcatcc 1860 
atttatacgt tgatacacgc gctggagcgt gagtggggcg tctggtttcc gcgtggcggc 192 0 
accggcgcat tagttcaggg gatgataaag ctgtttcagg atctgggtgg cgaagtcgtg 1980 
ttaaacgcca gagtcagcca tatggaaacg acaggaaaca agattgaagc cgtgcattta 2040 
gaggacggtc gcaggttcct gacgcaagcc gtcgcgtcaa atgcagatgt ggttcatacc 2100 
tatcgcgacc tgttaagcca gcaccctgcc gcggttaagc agtccaacaa actgcagact 2160 
aagcgcatga gtaactctct gtttgtgctc tattttggtt tgaatcacca tcatgatcag 2220 
ctcgcgcatc acacggtttg tttcggcccg cgttaccgcg agctgattga cgaaattttt 2280 
aatcatgatg gcctcgcaga ggacttctca ctttatctgc acgcgccctg tgtcacggat 2340 
tcgtcactgg cgcctgaagg ttgcggcagt tactatgtgt tggcgccggt gccgcattta 24 00 
ggcaccgcga acctcgactg gacggttgag gggccaaaac tacgcgaccg tatttttgcg 2460 
taccttgagc agcattacat gcctggctta cggagtcagc tggtcacgca ccggatgttt 2520 
acgccgtttg attttcgcga ccagcttaat gcctatcatg gctcagcctt ttctgtggag 2580 
cccgttctta cccagagcgc ctggtttcgg ccgcataacc gcgataaaac cattactaat 2640 
ctctacctgg tcggcgcagg cacgcatccc ggcgcaggca ttcctggcgt catcggctcg 2700 
gcaaaagcga cagcaggttt gatgctggag gatctgattt gaggccatgc aggccgatcc 2760 
ccgatcgttc aaacatttgg caataaagtt tcttaagatt gaatcctgtt gccggtcttg 2820 
cgatgattat catataattt ctgttgaatt acgttaagca tgtaataatt aacatgtaat 2880 
gcatgacgtt atttatgaga tgggttttta tgattagagt cccgcaatta tacatttaat 2940 
acgcgataga aaacaaaata tagcgcgca'a actaggataa attatcgcgc gcggtgtcat 3000 
ctatgttact agatcgggcc ttaataagct tgttaatcat ggtgtaggca acccaaataa 3 060 
aacaccaaaa tatgcacaag gcagtttgtt gtattctgta gtacagacaa aactaaaagt 3120 
aatgaaagaa gatgtggtgt tagaaaagga aacaatatca tgagtaatgt gtgagcatta 3180 
tgggaccacg aaataaaaag aacattttga tgagtcgtgt atcctcgatg agcctcaaaa 3240 
gttctctcac cccggataag aaacccttaa gcaatgtgca aagtttgcat tctccactga 3300 
cataatgcaa aataagatat catcgatgac atagcaactc atgcatcata tcatgcctct 3360 
ctcaacctat tcattcctac tcatctacat aagtatcttc agctaaatgt tagaacataa 3420 
acccataagt cacgtttgat gagtattagg cgtgacacat gacaaatcac agactcaagc 3480 
aagataaagc aaaatgatgt gtacataaaa ctccagagct atatgtcata ttgcaaaaag 3540 
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aggagagctt 


ataagacaag 


gcatgactca 


caaaaattca 


tttgcctttc 


gtgtcaaaaa 


3600 


gaggagggct 


ttacattatc 


catgtcatat 


tgcaaaagaa 


agagagaaag 


aacaacacaa 


3660 


tgctgcgtca 


attatacata 


tctgtatgtc 


catcattatt 


catccacctt 


tcgtgtacca 


3720 


cacttcatat 


atcatgagtc 


acttcatgtc 


tggacattaa 


caaactctat 


cttaacattt 


3780 


agatgcaaga 


gcctttatct 


cactataaat 


gcacgatgat 


ttctcattgt 


ttctcacaaa 


3840 


aagcattcag 


ttcattagtc 


ctacaacaac 


gaattcggct 


tcccgggtac 


agggtaaatt 


3900 


tctagttttt 


ctccttcatt 


ttcttggtta 


ggaccctttt 


ctctttttat 


ttttttgagc 


3960 


tttgatcttt 


ctttaaactg 


atctattttt 


taattgattg 


gttatcgtgt 


aaatattaca 


4020 


tagctttaac 


tgataatctg 


attactttat 


ttcgtgtgtc 


tttgatcatc 


ttgatagtta 


4080 


cagaaccgtc 


gactctagag 


aagccattta 


aatcgccgcc 


accatggcca 


tcatactcgt 


4140 


acgagcagcg 


tcgccggggc 


tctccgccgc 


cgacagcatc 


agccaccagg 


ggactctcca 


4200 


gtgctccacc 


ctgctcaaga 


cgaagaggcc 


ggcggcgcgc 


cggtggatgc 


cctgctcgct 


4260 


ccttggcctc 


cacccgtggg 


aggctggccg 


tccctccccc 


gccgtctact 


ccagcctcgc 


4320 


cgtcaacccg 


gcgggagagg 


ccgtcgtctc 


gtccgagcag 


aaggtctacg 


acgtcgtgct 


4380 


caagcaggcc 


gcattgctca 


aacgccagct 


gcgcacgccg 


gtcctcgacg 


ccaggcccca 


4440 


ggacatggac 


atgccacgca 


acgggctcaa 


ggaagcctac 


gaccgctgcg 


gcgagatctg 


4500 


tgaggagtat 


gccaagacgt 


tttacctcgg 


aactatgttg 


atgacagagg 


agcggcgccg 


4560 


cgccatatgg 


gccatctatg 


tgtggtgtag 


gaggacagat 


gagcttgtag 


atgggccaaa 


4620 


cgccaactac 


attacaccaa 


cagctttgga 


ccggtgggag 


aagagacttg 


aggatc.tgtt 


4680 


cacgggacgt 


Ccttacgaca 


tgcttgatgc 


cgctctctct 


gataccatct 


caaggttccc 


474 0 


catagacatt cagccattca 


gggacatgat 


tgaagggatg 


aggagtgatc 


ttaggaagac 


4800 


aaggtataac 


aacttcgacg 


agctctacat 


gtactgctac 


tatgttgctg 


gaactgtcgg 


4860 


gttaatgagc 


gtaccagtga 


tgggcatcgc 


atccgagtct 


aaagcaacaa 


ctgaaagcgt 


4920 


gtacagtgct 


gccttggctc 


tcggaattgc 


gaaccaactc 


acgaacatac 


tccgggatgt 


4980 


tggagaggat 


gctagacgag 


gaaggatata 


tttaccacaa 


gatgagcttg 


cacaggcagg 


5040 


gctctctga t 


gaggacatct 


tcaaaggggt 


cgtcacgaac 


cggtggagaa 


acttcatgaa 


5100 


gaggcagatc 


aagagggcca 


ggatgttttt 


tgaggaggca 


gagagagggg 


taactgagct 


5160 


ctcacaggct 


agcagatggc 


cagtatgggc 


ttccctgttg 


ttgtacaggc 


agatcctgga 


5220 


tgagatcgaa 


gccaacgact 


acaacaactt 


cacgaagagg 


gcgtatgttg 


gtaaagggaa 


5280 


gaagttgcta 


gcacttcctg 


tggcatatgg 


aaaatcgcta 


ctgctcccat 


gttcattgag 


5340 
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aaatggccag acctagggcc atgcaggccg atccccgatc gttcaaacat ttggcaataa 5400 

agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata atttctgttg 5460 

aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat gagatgggtt 552 0 

tttatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa aatatagcgc 5580 

gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg 5630 

<210> 3 

<211> 5180 

<212> DNA 

<213> SYNTHETIC - 12422 



<400> 3 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 60 

tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 12 0 

acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 18 0 

gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 240 

caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 300 

tagcaactca tgcatcatat catgcctct'c tcaacctatt cattcctact catctacata 360 

agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 420 

gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac 480 

tccagagcta tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 540 

aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 600 

gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 660 

atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 720 

ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 780 

cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 840 

aattcggctt cccaaatcgc cgccaccatg gcttctatga tatcctcttc cgctgtgaca 900 

acagtcagcc gtgcctctag ggggcaatcc gccgcagtgg ctccattcgg cggcctcaaa 960 

tccatgactg gattcccagt gaagaaggtc aacactgaca ttacttccat tacaagcaat 1020 

ggtggaagag taaagtgcat gaaaccaact acggtaattg gtgcaggctt cggtggcctg 1080 

gcactggcaa ttcgtctaca agctgcgggg atccccgtct tactgcttga acaacgtgat 1140 
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aaacccggcg gtcgggctta tgtctacgag gatcaggggt ttacctttga tgcaggcccg 1200 
acggttatca ccgatcccag tgccattgaa gaactgtttg cactggcagg aaaacagtta 1260 
aaagagtatg tcgaactgct gccggttacg ccgttttacc gcctgtgttg ggagtcaggg 1320 
aaggtcttta attacgataa cgatcaaacc cggctcgaag cgcagattca gcagtttaat 1380 
ccccgcgatg tcgaaggtta tcgtcagttt ctggactatt cacgcgcggt gtttaaagaa 1440 
ggctatctga agctcggtac tgtccctttt ttatcgttca gagacatgct tcgcgccgca 1500 
cctcaactgg cgaaactgca ggcatggaga agcgtttaca gtaaggttgc cagttacatc 1560 
gaagatgaac atctgcgcca ggcgttttct ttccactcgc tgttggtggg cggcaatccc 1620 
ttcgccacct catccattta tacgttgata cacgcgctgg agcgtgagtg gggcgtcfcgg 168 0 
tttccgcgtg gcggcaccgg cgcattagtt caggggatga taaagctgtt tcaggatctg 1740 
ggtggcgaag tcgtgttaaa cgccagagtc agccatatgg aaacgacagg aaacaagatt 1800 
gaagccgtgc atttagagga cggtcgcagg ttcctgacgc aagccgtcgc gtcaaatgca 1860 
gatgtggttc atacctatcg cgacctgtta agccagcacc ctgccgcggt taagcagtcc 1920 

aacaaactgc agacfcaagcg catgagtaac tctctgtttg tgctctattt tggtttgaat 1980 

caccatcatg atcagctcgc gcatcacacg gtttgtttcg gcccgcgtta ccgcgagctg 2 040 

attgacgaaa tttttaatca tgatggcctc gcagaggact tctcacttta tctgcacgcg 2100 

ccctgtgtca cggattcgtc actggcgcct gaaggttgcg gcagttacta tgtgttggcg 2160 

ccggtgccgc atttaggcac cgcgaacctc gactggacgg ttgaggggcc aaaactacgc 2220 * 

gaccgtattt ttgcgtacct tgagcagcat tacatgcctg gcttacggag tcagctggtc 22 80 

acgcaccgga tgtttacgcc gtttgatttt cgcgaccagc ttaatgccta tcatggctca 2340 

gccttttctg tggagcccgt tcttacccag agcgcctggt ttcggccgca taaccgcgat 2400 

aaaaccatta ctaatctcta cctggtcggc gcaggcacgc atcccggcgc aggcattcct 2460 

ggcgtcatcg gctcggcaaa agcgacagca ggtttgatgc tggaggatct gatttgaggc 252 0 

catgcaggcc gatccccgat cgttcaaaca tttggcaata aagtt'tctta agattgaatc 2580 

ctgttgccgg tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa 2640 

taattaacat gtaatgcatg acgttattta tgagatgggt ttttatgatt agagtcccgc 2700 

aattatacat ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat 2760 

cgcgcgcggt gtcatctatg ttactagatc gggccttaat aagcttgtta atcatggtgt 2820 

aggcaaccca aataaaacac caaaatatgc acaaggcagt ttgttgtatt ctgtagtaca 2880 

gacaaaacta aaagtaatga aagaagatgt ggtgttagaa aaggaaacaa tatcatgagt 294 0 
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aatgtgtgag cattatggga ccacgaaata aaaagaacat tttgatgagt cgtgtatcct 3 000 

cgatgagcct caaaagttct ctcaccccgg ataagaaacc cttaagcaat gtgcaaagtt 3 060 

tgcattctcc actgacataa tgcaaaataa gatatcatcg atgacatagc aactcatgca 312 0 

tcatatcatg cctctctcaa cctattcatt cctactcatc tacataagta tcttcagcta 3180 

aatgttagaa cataaaccca taagtcacgt ttgatgagta ttaggcgtga cacatgacaa 324 0 

atcacagact caagcaagat aaagcaaaat gatgtgtaca taaaactcca gagctatatg 3300 

tcatattgca aaaagaggag agcttataag acaaggcatg actcacaaaa attcatttgc 3360 

ctttcgtgtc aaaaagagga gggctttaca ttatccatgt catattgcaa aagaaagaga 3420 

gaaagaacaa cacaatgctg cgtcaattat acatatctgt atgtccatca ttattcatcc 3480 

acctttcgtg taccacactt catatatcat gagtcacttc atgtctggac attaacaaac 3540 

tctatcttaa catttagatg caagagcctt tatctcacta taaatgcacg atgatttctc 3600 

attgtttctc acaaaaagca ttcagttcat tagtcctaca acaacgaatt cggcttccca 3660 

aatcgccgcc accatggcca tcatactcgt acgagcagcg tcgccggggc tctccgccgc 3720 

cgacagcatc agccaccagg ggactctcca gtgctccacc ctgctcaaga cgaagaggcc 3780 

ggcggcgcgc cggtggatgc cctgctcgct ccttggcctc cacccgtggg aggctggccg 3840 

tccctccccc gccgtctact ccagcctcgc cgtcaacccg gcgggagagg ccgtcgtctc 3900 

gtccgagcag aaggtctacg acgtcgtgct caagcaggcc gcattgctca aacgccagct 3 960 

gcgcacgccg gtcctcgacg ccaggcccca ggacatggac atgccacgca acgggctcaa 4020 

ggaagcctac gaccgctgcg gcgagatctg tgaggagtat gccaagacgt tttacctcgg 4080 

aactatgttg atgacagagg agcggcgccg cgccatatgg gccatctatg tgtggtgtag 4140 

gaggacagat gagcttgtag atgggccaaa cgccaactac attacaccaa cagctttgga 42 00 

ccggtgggag aagagacttg aggatctgtt cacgggacgt ccttacgaca tgcttgatgc 42 60 

cgctctctct gataccatct caaggttccc catagacatt cagccattca gggacatgat 432 0 

tgaagggatg aggagtgatc ttaggaagac aaggtataac aacttcgacg agctctacat 43 80 

gtactgctac tatgttgctg gaactgtcgg gttaatgagc gtaccagtga tgggcatcgc 444 0 

atccgagtct aaagcaacaa ctgaaagcgt gtacagtgct gccttggctc hcggaattgc 4 500 

gaaccaactc acgaacatac tccgggatgt tggagaggat gctagacgag gaaggatata 4560 

tttaccacaa gatgagcttg cacaggcagg gctctctgat gaggacatct tcaaaggggt 4620 

cgtcacgaac cggtggagaa acttcatgaa gaggcagatc aagagggcca ggatgttttt 4680 

tgaggaggca gagagagggg taactgagct ctcacaggct agcagatggc cagtatgggc 4 74 0 
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ttccctgttg ttgtacaggc agatcctgga tgagatcgaa gccaacgact acaacaactt 4 8.00 

cacgaagagg gcgtatgttg gtaaagggaa gaagttgcta gcacttcctg tggcatatgg 4860 

aaaatcgcta ctgctcccat gttcattgag aaatggccag acctagggcc atgcaggccg 4320 

atccccgatc gttcaaacat ttggcaataa agtttcttaa gattgaatcc tgttgccggt 4980 

cttgcgatga ttatcatata atttctgttg aattacgtta agcatgtaat aattaacatg 5040 

taatgcatga cgttatttat gagatgggtt tttatgatta gagtcccgca attatacatt 5100 

taatacgcga tagaaaacaa aatatagcgc gcaaactagg ataaattatc gcgcgcggtg 5160 

tcatctatgt tactagatcg 5180 

<210> 4 

<211> 5180 

<212> DNA 

<213> SYNTHETIC - 12424 



<400> 4 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 60 

tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 120 

acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 18 0 

gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 24 0 

caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 300 

tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact catctacata 360 

agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 420 

gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac 480 

tccagagcta tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 540 

aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 600 

gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 660 

atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 720 

ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 780 

cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 840 

aattcggctt cccaaatcgc cgccaccatg gcttctatga tatcctcttc cgctgtgaca 900 

acagtcagcc gtgcctctag ggggcaatcc gccgcagtgg ctccattcgg cggcctcaaa 960 
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tccatgactg gattcccagt gaagaaggtc aacactgaca ttacttccat tacaagcaat 1020 
ggtggaagag taaagtgcat gaaaccaact acggtaattg gtgcaggctt cggtggcctg 1080 
gcactggcaa ttcgtctaca agctgcgggg atccccgtct tactgcttga acaacgtgat 1140 
aaacccggcg gtcgggctta tgtctacgag gatcaggggt ttacctttga tgcaggcccg 1200 
acggttatca ccgatcccag tgccattgaa gaactgtttg cactggcagg aaaacagtta 12 60 
aaagagtatg tcgaactgct gccggttacg ccgttttacc gcctgtgttg ggagtcaggg 1320 
aaggtcttta attacgataa cgatcaaacc cggctcgaag cgcagattca gcagtttaat 1380 
ccccgcgatg tcgaaggtta tcgtcagttt ctggactatt cacgcgcggt gtttaaagaa 1440 
ggctatctga agctcggtac tgtccctttt ttatcgttca gagacatgct tcgcgccgca 1500 

cctcaactgg cgaaactgca ggcatggaga agcgtttaca gtaaggttgc cagttacatc 1560 

gaagatgaac atctgcgcca ggcgttttct ttccactcgc tgttggtggg cggcaatccc 1620 

ttcgccacct catccattta tacgttgata cacgcgctgg agcgtgagtg gggcgtctgg 1680 

tttccgcgtg gcggcaccgg cgcattagtt caggggatga taaagctgtt tcaggatctg 1740 

ggtggcgaag tcgtgttaaa cgccagagtc agccatatgg aaacgacagg aaacaagatt 1800 

gaagccgtgc atttagagga cggtcgcagg ttcctgacgc aagccgtcgc gtcaaatgca 1860 

gatgtggttc atacctatcg cgacctgtta agccagcacc ctgccgcggt taagcagtcc 1920 

aacaaactgc agactaagcg catgagtaac tctctgtttg tgctctattt tggtttgaat 1980 

caccatcatg atcagctcgc gcatcacacg gtttgtttcg gcccgcgtta ccgcgagctg 2040 

attgacgaaa tttttaatca tgatggcctc gcagaggact tctcacttta tctgcacgcg 210 0 

ccctgtgtca cggattcgtc actggcgcct gaaggttgcg gcagttacta tgtgttggcg 2160 

ccggtgccgc atttaggcac cgcgaacctc gactggacgg ttgaggggcc aaaactacgc 2220 

gaccgtattt ttgcgtacct tgagcagcat tacatgcctg gcttacggag tcagctggtc 2280 

acgcaccgga tgtttacgcc gtttgatttt cgcgaccagc ttaatgccta tcatggctca 2340 

gccttttctg tggagcccgt tcttacccag agcgcctggt ttcggccgca taaccgcgat 2400 

aaaaccatta ctaatctcta cctggtcggc gcaggcacgc atcccggcgc aggcattcct 2460 

ggcgtcatcg gctcggcaaa agcgacagca ggtttgatgc tggaggatct gatttgaggc 252 0 

catgcaggcc gatccccgat cgttcaaaca tttggcaata aagtttctta agattgaatc 2580 

ctgttgccgg tcttgcgatg attatcatat aatttctgtt gaattacgtt aagcatgtaa 2640 

taattaacat gtaatgcatg acgttattta tgagatgggt ttttatgatt agagtcccgc 2700 

aattatacat ttaatacgcg atagaaaaca aaatatagcg cgcaaactag gataaattat 2760 
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cgcgcgcggt gtcatctatg 


[ ttactagatc 


: gggccttaat 


aagcttgtta 


atcatggtgt 


2820 


aggcaaccca aataaaacac 


caaaatatgc 


acaaggcagt 


ttgttgtatt 


ctgtagtaca 


2880 


gacaaaacta aaagtaatga 


aagaagatgt 


ggtgttagaa 


aaggaaacaa 


tatcatgagt 


2940 


aatgtgtgag cattatggga 


ccacgaaata 


aaaagaacat 


tttgatgagt 


cgtgtatcct 


3000 


cgatgagcct caaaagttct 


ctcaccccgg 


ataagaaacc 


cfctaagcaat 


gtgcaaagtt 


3060 


tgcattctcc actgacataa 


tgcaaaataa 


gatatcatcg 


atgacatagc 


aactcatgca 


3120 


tcatatcatg cctctctcaa 


cctattcatt 


cctactcatc 


tacataagta 


tcttcagcta 


3180 


aatgttagaa cataaaccca 


taagtcacgt 


ttgatgagta 


ttaggcgtga 


cacatgacaa 


3240 


atcacagact caagcaagat 


aaagcaaaat 


gatgtgtaca 


taaaactcca 


gagctatatg 


3300 


tcatattgca aaaagaggag agcttataag acaaggcatg 


actcacaaaa 


attcatttgc 


3360 


ctttcgtgtc aaaaagagga 


gggctttaca 


ttatccatgt 


catattgcaa 


aagaaagaga 


3420 


gaaagaacaa cacaatgctg 


cgtcaattat 


acatatctgt 


atgtccatca 


ttattcatcc 


•3480 


acctttcgtg taccacactt 


catatatcat 


gagtcacttc 


atgtctggac 


attaacaaac 


3540 


tctatcttaa catttagatg 


caagagcctt 


tatctcacta 


taaatgcacg 


atgatttctc 


3600 


attgtttctc acaaaaagca 


ttcagttcat 


tagtcctaca 


acaacgaatt 


cggcttccca 


3660 


aatcgccgcc accatggcca 


tcatactcgt 


acgagcagcg 


tcgccggggc 


tctccgccgc 


3720 


cgacagcatc agccaccagg ggactctcca gtgctccacc ctgctcaaga 


cgaagaggcc 


3780 


ggcggcgcgg cggtggatgc 


cctgctcgct 


ccttggcctc 


cacccgfcggg 


aggctggccg 


3840 


tccctccccc gccgtctact 


ccagcctgcc 


cgtcaacccg 


gcgggagagg 


ccgtcgtctc 


3900 


gtccgagcag aaggtctacg 


acgtcgtgct 


caagcaggcc 


gcattgctca 


aacgccagct 


3960 


gcgcacgccg gtcctcgacg 


ccaggcccca 


ggacatggac 


atgccacgca 


acgggctcaa 


4020 


ggaagcctac gaccgctgcg gcgagatctg 


tgaggagfcat 


gccaagacgt 


tttacctcgg 


4080 


aactatgttg atgacagagg 


agcggcgccg 


cgccatatgg 


gccatctatg 




4140 


gaggacagat gagcttgtag 


atgggccaaa 


cgccaactac 


attacaccaa 


cagctttgga 


4200 


ccggtgggag aagagacttg 


aggatctgtt 


cacgggacgt 


ccttacgaca 


tgcttgatgc 


4260 


cgctctctct gataccatct 


caaggttccc 


catagacatt 


cagccattca 


gggacatgat 


4320 


tgaagggatg aggagtgatc 


ttaggaagac 


aaggtataac 


aacttcgacg 


agctctacat 


4380 


gtactgctac tatgttgctg 


gaactgtcgg 


gttaatgagc 


gtacctgtga 


tgggcatcgc 


4440 


aaccgagtct aaagcaacaa 


ctgaaagcgt 


atacagtgct 


gccttggctc 


tgggaattgc 


4500 


gaaccaactc acgaacatac 


tccgggatgt 


tggagaggat 


gctagaagag 


gaaggatata 


4560 
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tttaccacaa gatgagcttg cacaggcagg gctctctgat 
cgtcacgaac cggtggagaa acttcatgaa gaggcagatc 
tgaggaggca gagagagggg taactgagct ctcacaggct 
ttccctgttg ttgtacaggc agatcctgga tgagatcgaa 
cacgaagagg gcgtatgttg gtaaagggaa gaagttgcta 
aaaatcgcta ctgctcccat gttcattgag aaatggccag 
atccccgatc gttcaaacat ttggcaataa agtttcttaa 
cttgcgatga ttatcatata atttctgttg aattacgtta 
taatgcatga cgttatttat gagatgggtt tttatgatta 
taatacgcga tagaaaacaa aatatagcgc gcaaactagg 
tcatctatgt tactagatcg 



gaggacatct tcaaaggggt 4 620 

aagagggcca ggatgttttt 4 680 

agcagatggc cagtatgggc 4740 

gccaacgact acaacaactt 4800 

gcacttcctg tggcatatgg 4860 

acctagggcc atgcaggccg 4920 

gattgaatcc tgttgccggt 4980 

agcatgtaat aattaacatg 5 040 

gagtcccgca attatacatt 5100 

ataaattatc gcgcgcggtg 5160 

5180 



<210> 5 

<211> 5653 

<212> DNA 

<213> SYNTHETIC 



<400> 5 



gttaatcatg 


gtgtaggcaa 


cccaaataaa acaccaaaat atgcacaagg 


cagtttgttg 


60 


tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt 


agaaaaggaa 


120 


acaatatcat 


gagtaatgtg 


tgagcattat gggaccacga aataaaaaga 


acattttgat 


180 


gagtcgtgta 


tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 


240 


caatgtgcaa 


agtttgcatt 


ctccactgac ataatgcaaa ataagatatc 


atcgatgaca 


300 


tagcaactca 


tgcatcatat 


catgcctctc tcaacctatt cattcctact 


catctacata 


360 


agtatcttca gctaaatgtt 


agaacataaa cccataagtc acgtttgatg 


agtattaggc 


420 


gtgacacatg 


acaaatcaca 


gactcaagca agataaagca aaatgatgtg 


tacataaaac 


480 


tccagagcta 


tatgtcatat 


tgcaaaaaga ggagagctta taagacaagg 


catgactcac 


540 


aaaaattcat 


ttgcctttcg 


tgtcaaaaag aggagggctt tacattatcc 


atgtcatatt 


600 


gcaaaagaaa 


gagagaaaga 


acaacacaat gctgcgtcaa ttatacatat 


ctgtatgtcc 


660 


atcattattc 


atccaccttt 


cgtgtaccac acttcatata tcatgagtca 


cttcatgtct 


720 


ggacattaac 


aaactctatc 


ttaacattta gatgcaagag cctttatctc 


actataaatg 


780 
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cacgatgatt 


. tctcattgtt 


tctcacaaaa 


agcattcagt 


tcattagtcc 


tacaacaacg 


840 


aattcggctt 


cccgggtaca 


gggtaaattt 


ctagtttttc 


tccttcattt 


tcttggttag 


900 


gacccttttc 


tctttttatt 


tttttgagct 


ttgatctttc 


tttaaactga 


tctatttttt 


960 


aattgattgg 


ttatcgtgta 


aatattacat 


agctttaact 


gataatctga 


ttactttatt 


1020 


tcgtgtgtct 


ttgatcatct 


tgatagttac 


agaaccgtcg 


actctagaga 


agccatttaa 


1080 


atcgccgcca 


ccatggcttc 


tatgatatcc 


tcttccgctg 


tgacaacagt 


cagccgtgcc 


1140 


tctagggggc 


aatccgccgc 


agtggctcca 


ttcggcggcc 


tcaaatccat 


gactggattc 


1200 


ccagtgaaga 


aggtcaacac 


tgacattact 


tccattacaa 


gcaatggtgg 


aagagtaaag 


1260 


tgcatggcgg 


ccgccaaacc 


aactacggta 


attggtgcag 


gcttcggtgg 


cctggcactg 


1320 


gcaattcgtc 


tacaagctgc 


ggggatcccc 


gtcttactgc 


ttgaacaacg 


tgataaaccc 


1380 


ggcggtcggg 


cttatgtcta 


cgaggatcag 


gggtttacct 


ttgatgcagg 


cccgacggtt 


1440 


atcaccgatc 


ccagtgccat 


tgaagaactg 


tttgcactgg 


caggaaaaca 


gttaaaagag 


1500 


tatgtcgaac 


tgctgccggt 


tacgccgttt 


taccgcctgt 


gttgggagtc 


agggaaggtc 


1560 


tttaattacg 


ataacgatca 


aacccggctc 


gaagcgcaga 


ttcagcagtt 


taatccccgc 


1620 


gatgtcgaag gttatcgtca gfcttctggac tattcacgcg cggtgtttaa agaaggctat 


1680 


ctgaagctcg 


gtactgtccc 


ttttttatcg 


ttcagagaca 


tgcttcgcgc 


cgcaccfccaa 


1740 


ctggcgaaac 


tgcaggcatg 


gagaagcgtt 


tacagtaagg 


ttgccagtta 


catcgaagat 


1800 


gaacatctgc 


gccaggcgtt 


ttctttccac 


tcgctgttgg 


tgggcggcaa 


tcccttcgcc 


1860 


acctcatcca 


tttatacgtt 


gatacacgcg 


ctggagcgtg 


agtggggcgt 


ctggtttccg 


1920 


cgtggcggca 


ccggcgcatt 


agttcagggg 


atgataaagc 


tgtttcagga 


tctgggtggc 


1980 


gaagtcgtgt 


taaacgccag agtcagccat 


atggaaacga 


caggaaacaa 


gattgaagcc 


2040 


gtgcatttag 


aggacggtcg 


caggttcctg 


acgcaagccg 


tcgcgtcaaa 


tgcagatgtg 


2100 


gttcatacct 


atcgcgacct 


gttaagccag 


caccctgccg 


cggttaagca 


gtccaacaaa 


2160 


ctgcagacta 


agcgcatgag 


taactctctg 


tttgtgctct 


attttggttt 


gaatcaccat 


2220 


catgatcagc 


tcgcgcatca 


cacggtttgt 


ttcggcccgc 


gttaccgcga 


gctgattgac 


2280 


gaaattttta 


atcatgatgg 


cctcgcagag 


gacttctcac 


Cttatctgca 


cgcgccctgt 


2340 


gtcacggatt 


cgtcactggc 


g'cctgaaggt 


tgcggcagtt 


actatgtgtt 


ggcgccggtg 


2400 


ccgcat ttag 


gcaccgcgaa 


cctcgactgg 


acggttgagg 


ggccaaaact 


acgcgaccgt 


2460 


atttttgcgt 


accttgagca 


gcattacatg 


cctggcttac 


ggagtcagct 


ggtcacgcac 


2520 


cggatgttta 


cgccgtttga 


ttttcgcgac 


cagcttaatg 


cctatcatgg 


ctcagccttt 


2580 
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tctgtggagc ccgttcttac ccagagcgcc tggtttcggc cgcataaccg cgataaaacc 2640 

attactaatc tctacctggt cggcgcaggc acgcatcccg gcgcaggcat tcctggcgtc 2700 

atcggctcgg caaaagcgac agcaggtttg atgctggagg atctgatttg aggtacctcg 2760 

acggccatgc aggccgatcc ccgatcgttc aaacatttgg caataaagtt tcttaagatt 2820 

gaatcctgtt gccggtcttg cgatgattat catataattt ctgttgaatt acgttaagca 2880 

tgtaataatt aacatgtaat gcatgacgtt atttatgaga tgggttttta tgattagagt 2940 

cccgcaatta tacatttaat acgcgataga aaacaaaata tagcgcgcaa actaggataa 3 000 

attatcgcgc gcggtgtcat ctatgttact agatcgggcc ttaatcgcaa gcttgttaat 3060 

catggtgtag gcaacccaaa taaaacacca aaatatgcac aaggcagttt gttgtattct 3120 

gtagtacaga caaaactaaa agtaatgaaa gaagatgtgg tgttagaaaa ggaaacaata 3180 

tcatgagtaa tgtgtgagca ttatgggacc acgaaataaa aagaacattt tgatgagtcg 3240 

tgtatcctcg atgagcctca aaagttctct caccccggat aagaaaccct taagcaatgt 33 00 

gcaaagtttg cattctccac tgacataatg caaaataaga tatcatcgat gacatagcaa 3360 

ctcatgcatc atatcatgcc tctctcaacc tattcattcc tactcatcta cataagtatc 3420 

ttcagctaaa tgttagaaca taaacccata agtcacgttt gatgagtatt aggcgtgaca 34 80 

catgacaaat cacagactca agcaagataa agcaaaatga tgtgtacata aaactccaga 3540 

gctatatgtc atattgcaaa aagaggagag cttataagac aaggcatgac tcacaaaaat 3 600 

tcatttgcct ttcgtgtcaa aaagaggagg gctttacatt atccatgtca tattgcaaaa 3660 

gaaagagaga aagaacaaca caatgctgcg tcaattatac atatctgtat gtccatcatt 3720 

attcatccac ctttcgtgta ccacacttca tatatcatga gtcacttcat gtctggacat 3780 

taacaaactc tatcttaaca tttagatgca agagccttta tctcactata aatgcacgat 3840 

gatttctcat tgtttctcac aaaaagcatt cagttcatta gtcctacaac aacgaattcg 3900 

gcttcccggg tacagggtaa atttctagtt tttctccttc attttcttgg ttaggaccct 3960 

tttctctttt tatttttttg agctttgatc tttctttaaa ctgatctatt ttttaattga 4020 

ttggttatcg tgtaaatatt acatagcttt aactgataat ctgattactt tatttcgtgt 4080 

gtctttgatc atcttgatag ttacagaacc gtcgactcta gagaagccat ttaaatcgcc 4140 

gccaccatgg ccatcatact cgtacgagca gcgtcgccgg ggctctccgc cgccgacagc 4200 

atcagccacc aggggactct ccagtgctcc accctgctca agacgaagag gccggcggcg 42 60 

c gg c 99tgga tgccctgctc gctccttggc ctccacccgt gggaggctgg ccgtccctcc 4320 

cccgccgtct actccagcct gcccgtcaac ccggcgggag aggccgtcgt ctcgtccgag 4380 
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cagaaggtct 


acgacgtcgt 


gctcaagcag 


gccgcattgc 


tcaaacgcca 


gctgcgcacg 


4440 


ccggtcctcg 


acgccaggcc 


ccaggacatg 


gacatgccac 


geaaeggget 


caaggaagee 


4 5 00 


tacgaccgct 


gcggcgagat 


ctgtgaggag 


tatgecaaga 


cgttttacct 


eggaactatg 


4560 


ttgatgacag 


aggagcggcg 


ccgcgccata 


tgggecatet 


atgtgtggtg 


taggaggaca 


4620 


gatgagcttg 


tagatgggcc 


aaacgccaac 


tacattacac 


caacagcttt 


ggaccggtgg 


4680 


gagaagagac 


ttgaggatct 


gttcacggga 


cgtccttacg 


acatgettga 


tgccgctctc 


4740 


tctgatacca 


tctcaaggtt 


ccccatagac 


attcagecat 


tcagggacat 


gattgaaggg 


4800 


atgaggagtg 


atcttaggaa 


gacaaggtat 


aacaacttcg 


acgagctcta 


catgtactgc 


4860 


tactatgttg 


ctggaactgt 


cgggttaatg 


agegtacctg 


tgatgggcat 


cgcaaccgag 


4920 


tctaaagcaa 


caactgaaag 


cgtatacagt 


getgecttgg 


ctctgggaat 


tgcgaaccaa 


4980 


ctcacgaaca 


tactccggga 


tgttggagag 


gatgetagaa 


gaggaaggat 


atatttacca 


5040 


caagatgagc 


ttgcacaggc 


agggctctct 


gatgaggaca 


tcttcaaagg 


ggtcgtcacg 


5100 


aaecggtgga 


gaaacttcat 


gaagaggcag 


atcaagaggg 


ccaggatgtt 


ttttgaggag 


5160 


gcagagagag gggtaaatga 


gctctcacag 


gctagcagat ggccagtatg 


ggcttccctg 


5220 


ttgttgtaca 


ggcagatcct 


ggatgagatc 


gaagccaacg 


actacaacaa 


cttcacgaag 


5280 


agggcgtatg 


ttggtaaagg 


gaagaagttg 


ctagcacttc 


ctgtggcata 


tggaaaatcg 


5340 


ctactgctcc 


catgttcatt 




cagacctagg gecatgeagg 


ccaahcccca 


54 00 


atcgttcaaa 


catttggcaa 


taaagtttct 


taagattgaa 


tcctgttgcc 


ggtcttgega 


5460 


tgattatcat 


ataatttctg 


ttgaattacg 


ttaagcatgt 


aataattaac 


atgtaatgca 


5520 


tgacgttatt 


tatgagatgg 


gtttttatga 


ttagagtccc 


gcaattatac 


atttaatacg 


5580 


cgatagaaaa 


caaaatatag 


cgcgcaaact 


aggataaatt 


atcgcgcgcg 


gtgtcatcta 


5640 


tgttactaga 


teg 










5653 



<210> 6 

<211> 5714 

<212> DNA 

<213> SYNTHETIC - 11586 



<400> 6 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgeacaagg cagtttgttg 



tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 



60 
120 
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acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 180 
gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 240 
caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 3 00 
tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact .catctacata 350 

agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 420 

gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac 480 

tccagagcta tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 54 0 

aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 600 

gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 660 

atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 720 ' 

ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 780 

cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 84 0 

aattcggctt cccgggtaca gggtaaattt ctagtttttc tccttcattt tcttggttag 900 

gacccttttc tctttttatt tttttgagct ttgatctttc tttaaactga tctatttttt 960 

aattgattgg ttatcgtgta aatattacat agctttaact gataatctga ttactttatt 1020 

tcgtgtgtct ttgatcatct tgatagttac agaaccgtcg actctagaga agccatttaa 1080 

atcgccgcca ccatggcttc tatgatatcc tcttccgctg tgacaacagt cagccgtgcc 1140 

tctagggggc aatccgccgc agtggctcca ttcggcggcc tcaaatccat gactggattc 1200 

ccagtgaaga aggtcaacac tgacattact tccattacaa gcaatggtgg aagagtaaag 1260 

tgcatggcgg ccgccaaacc aactacggta attggtgcag gcttcggtgg cctggcactg 1320 

gcaattcgtc tacaagctgc ggggatcccc gtcttactgc ttgaacaacg tgataaaccc 1380 

ggcggtcgggcttatgtcta cgaggatcag gggtttacct ttgatgcagg cccgacggtt 1440 

atcaccgatc ccagtgccat tgaagaactg tttgcactgg caggaaaaca gttaaaagag 1500 

tatgtcgaac tgctgccggt tacgccgttt taccgcctgt gttgggagtc agggaaggtc 1560 

tttaattacg ataacgatca aacccggctc gaagcgcaga ttcagcagtt taatccccgc 1620 

gatgtcgaag gttatcgfcca gtttctggac tattcacgcg cggtgtttaa agaaggctat 1680 

ctgaagctcg gtactgtccc ttttttatcg ttcagagaca tgcttcgcgc cgcacctcaa 1740 

ctggcgaaac tgcaggcatg gagaagcgtt tacagtaagg ttgccagtta catcgaagat 1800 

gaacatctgc gccaggcgtt ttctttccac tcgctgttgg tgggcggcaa tcccttcgcc i860 

acctcatcca tttatacgtt gatacacgcg ctggagcgtg agtggggcgt ctggtttccg 1920 
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cgtggcggca ccggcgcatt agttcagggg atgataaagc tgtttcagga tctgggtggc 1980 

gaagtcgtgt taaacgccag agtcagccat atggaaacga caggaaacaa gattgaagcc 2040 

gtgcatttag aggacggtcg caggttcctg acgcaagccg tcgcgtcaaa tgcagatgtg 2100 

gttcatacct atcgcgacct gttaagccag caccctgccg cggttaagca gtccaacaaa 2160 

ctgcagacta agcgcatgag taactctctg tttgtgctct attttggttt gaatcaccat 2220 

catgatcagc tcgcgcatca cacggtttgt ttcggcccgc gttaccgcga gctgattgac 2280 

gaaattttta atcatgatgg cctcgcagag gacttctcac tttatctgca cgcgccctgt 2340 

gtcacggatt cgtcactggc gcctgaaggt tgcggcagtt actatgtgtt ggcgccggtg 2400 

ccgcatttag gcaccgcgaa cctcgactgg acggttgagg ggccaaaact acgcgaccgt 2460 

atttttgcgt accttgagca gcattacatg ' cctggcttac ggagtcagct ggtcacgcac 2520 

cggatgttta cgccgtttga ttttcgcgac cagcttaatg cctatcatgg ctcagccttt 2580 

tctgtggagc ccgttcttac ccagagcgcc tggtttcggc cgcataaccg cgataaaacc 2 640 

attactaatc tctacctggt cggcgcaggc acgcatcccg gcgcaggcat tcctggcgtc 2700 

atcggctcgg caaaagcgac agcaggtttg atgctggagg atctgatttg aggtacctcg 2 7 60 

acggccatgc aggccgatcc ccgatcgttc aaacatttgg caataaagtt tcttaagatt 2820 

gaafccctgtt gccggtctfcg cgatgattat catataattt ctgttgaatt acgttaagca 2880 

tgtaataatt aacatgtaat gcatgacgtt atttatgaga tgggttttta tgattagagt - 2 94 0. 

cccgcaatta tacatttaat acgcgataga aaacaaaata tagcgcgcaa actaggataa 3000 

attatcgcgc gcggtgtcat ctatgttact agatcgggcc ttaaaactga aggcgggaaa 3 060 

cgacaatctg atcbctagga agcttgttaa tcatggtgta ggcaacccaa ataaaacacc 3120 

aaaatatgca caaggcagtt tgttgtattc tgtagtacag acaaaactaa aagtaatgaa 3180 

agaagatgtg gtgttagaaa aggaaacaat atcatgagta atgtgtgagc attatgggac * 3240 

cacgaaataa aaagaacatt ttgatgagtc gtgtatcctc gatgagcctc aaaagttctc 3 3 00 

tcaccccgga taagaaaccc ttaagcaatg tgcaaagttt gcattctcca ctgacataat 33 60 

gcaaaataag atatcatcga tgacatagca actcatgcat catatcatgc ctctctcaac 3420 

ctattcattc ctactcatct acataagtat cttcagctaa atgttagaac ataaacccat 3480 

aagtcacgtt tgatgagtat taggcgtgac acatgacaaa tcacagactc aagcaagata 3540 

aagcaaaatg atgtgtacat aaaactccag agctatatgt catattgcaa aaagaggaga 3600 

gcttataaga caaggcatga ctcacaaaaa ttcatttgcc tttcgtgtca aaaagaggag 3660 

ggctttacat tatccatgtc atattgcaaa agaaagagag aaagaacaac acaatgctgc 3720 
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gtcaattata catatctgta tgtccatcat tattcatcca cctttcgtgt accacacttc 3780 
atatatcatg agtcacttca tgtcfcggaca ttaacaaact ctatcttaac atttagatgc 3840 
aagagccttt atctcactat aaatgcacga tgatttctca ttgtttctca caaaaagcat 3900 
tcagttcatt agtcctacaa caacgaattc ggcttcccgg gtacagggta aatttctagt 3960 
ttttctcctt cattttcttg gttaggaccc ttttctcttt ttattttttt gagctttgat 4020 
ctttctttaa actgatctat tttttaattg attggttatc gtgtaaatat tacatagctt 4080 
taactgataa tctgattact ttatttcgtg tgtctttgat catcttgata gttacagaac 4140 
cgtcgactct agagaageca tttaaatcgc cgccaccatg gcggccatca cgctcctacg 4200 
ttcagcgtct cttccgggcc tctccgacgc cctcgcccgg gacgctgctg ccgtccaaca 4260 
tgtctgctcc tcctacctgc ccaacaacaa ggagaagaag aggaggtgga tcctctgctc 4320 
gctcaagtac gcctgccttg gcgtcgaccc tgccccgggc gagattgccc ggacctcgcc 4380 
ggtgtactcc agcctcaccg tcacccctgc tggagaggcc gtcatctcct cggagcagaa 4440 
ggtgtacgac gtcgtcctca agcaggcagc attgctcaaa cgccacctgc gcccacaacc 4500 
acacaccatt cccatcgttc ccaaggacct ggacctgcca agaaacggcc tcaagcaggc 4560 
ctatcatcgc tgcggagaga tctgcgagga gtatgccaag accttttacc ttggaactat 4620 
gctcatgacg gaggaccgac ggcgcgccat atgggccatc tatgtgtggt gtaggaggac 4680 
agatgagctt gtagatggac caaatgcctc gcacatcaca ccgtcagccc tggaccggt.g 4740 
ggagaagagg cttgatgatc tcttcaccgg acgcccctac gacatgcttg atgctgcact 4 800 
ttctgatacc atctccaagt ttcctataga tattcagcct ttcagggaca tgatagaagg 4 860 
gatgcggtca gacctcagaa agactagata caagaacttc gacgagctct acatgtactg 4920 ■ 
ctactatgtt gctggaactg tggggctaat gagtgttcct gtgatgggta ttgcacccga 4980 
gtcgaaggca acaactgaaa gtgtgtacag tgctgctttg gctctcggca ttgcaaacca 504 0 
gctcacaaat atactccgtg acgttggaga ggacgcgaga agagggagga tatatttacc 5100 
acaagatgaa cttgcagagg cagggctctc tgatgaggac atcttcaatg gcgttgtgac 5160 
taacaaatgg agaagcttca tgaagagaca gatcaagaga gctaggatgt tttttgagga 5220 
ggcagagaga ggggtgaccg agctcagcca ggcaagccgg tggccggtct gggcgtctct 5280 
gttgttatac cggcaaatcc ttgacgagat agaagcaaac gattacaaca acttcacaaa 5340 
gagggcgtac gtfcgggaagg cgaagaaatt gctagcgctt ccagttgcat atggtagatc 5400 
attgctgatg ccctactcac tgagaaatag ccagaagtag ggccatgcag gccgatcccc 5460 
gatcgttcaa acatttggca ataaagtttc ttaagattga atcctgttgc cggtcttgcg 5520 
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atgattatca tataatttct gttgaattac gttaagcatg taataattaa catgtaatgc 5580 

atgacgttat ttatgagatg ggtttttatg attagagtcc cgcaattata catttaatac 5640 

gcgatagaaa acaaaatata gcgcgcaaac taggataaat tatcgcgcgc ggtgtcatct 5700 

atgttactag atcg 5714 

<210> 7 

<211> 5974 

<212> D^7A 

<213> SYNTHETIC - 7 651 



<400> 7 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 60 

tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa '120 

acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 18 0 

gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 240 

caatgtgcaa agtttgcatt ctccactgac ataatgcaaa ataagatatc atcgatgaca 300 

tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact catctacata 360 

agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 420 

gtgacacatg acaaatcaca gactcaagca agataaagca aaatg.atgtg tacataaaac 4 80 

tccagagcta tatgtcatat tgcaaaaaga ggagagcfcta taagacaagg catgactcac 54 0 

aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 600 

gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 660 

atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 720 

ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 780 

cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 840 

aattcggctt cccgggtaca gggtaaattt ctagtttttc tccttcattt tcttggttag 900 

gacccttttc tctttttatt tttttgagct ttgatctttc tttaaactga tctatttttt 960 

aattgattgg ttatcgtgta aatattacat agctttaact gataatctga ttactttatt 1020 

tcgtgtgtct ttgatcatct tgafcagttac agaaccgtcg actctagaga agccatttaa 1080 

atcgccgcca ccatggcttc tatgatatcc tcttccgctg tgacaacagt cagccgtgcc 1140 

tctagggggc aatccgccgc agtggctcca ttcggcggcc tcaaatccat gactggattc 1200 
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ccagtgaaga aggtcaacac tgacattact tccattacaa gcaatggtgg aagagtaaag 1260 
tgcatggcgg ccgccaaacc aactacggta attggtgcag gcttcggtgg cctggcactg 1320 
gcaattcgtc tacaagctgc ggggatcccc gtcttactgc ttgaacaacg tgataaaccc 1380 
ggcggtcggg cttatgtcta cgaggatcag gggtttacct ttgatgcagg cccgacggtt 1440 
atcaccgatc ccagtgccat tgaagaactg tttgcactgg caggaaaaca gttaaaagag 1500 
tatgtcgaac tgctgccggt tacgccgttt taccgcctgt gttgggagtc agggaaggtc 1560 
tttaattacg ataacgatca aacccggctc gaagcgcaga ttcagcagtt taatccccgc 162 0 
gatgtcgaag gttatcgtca gtttctggac tattcacgcg cggtgtttaa agaaggctat 1680 
ctgaagctcg gtactgtccc ttttttatcg ttcagagaca tgcttcgcgc cgcacctcaa 1740 
ctggcgaaac tgcaggcatg gagaagcgtt tacagtaagg ttgccagtta catcgaagat 1800 
gaacatctgc gccaggcgtt ttctttccac tcgctgttgg tgggcggcaa tcccttcgcc 1860 
acctcatcca tttatacgtt gatacacgcg ctggagcgtg agtggggcgt ctggtttccg 1920 
cgtggcggca ccggcgcatt agttcagggg atgataaagc tgtttcagga tctgggtggc 1980 
gaagtcgtgt taaacgccag agtcagccat atggaaacga caggaaacaa gattgaagcc 2040 
gtgcatttag aggacggtcg caggttcctg acgcaagccg tcgcgtcaaa tgcagatgtg 2100 
gttcatacct atcgcgacct gttaagccag caccctgccg cggttaagca gtccaacaaa 2160 
ctgcagacta agcgcatgag taactctctg tttgtgctct attttggttt gaatcaccat 2220 
catgatcagc tcgcgcatca cacggtttgt ttcggcccgc gttaccgcga gctgattgac 2280 
gaaattttta atcatgatgg cctcgcagag gacttctcac tttatctgca cgcgccctgt 2340 
gtcacggatt cgtcactggc gcctgaaggt tgcggcagtt actatgtgtt ggcgccggtg 2400 
ccgcatttag gcaccgcgaa cctcgactgg acggttgagg ggccaaaact acgcgaccgt 2460 
atttttgcgt accttgagca gcattacatg cctggcttac ggagtcagct ggtcacgcac 2520 
cggatgttta cgccgtttga ttttcgcgac cagcttaatg cctatcatgg ctcagccttt 2580 
tctgtggagc ccgttcttac ccagagcgcc tggtttcggc cgcataaccg cgataaaacc 2640 
attactaatc tctacctggt cggcgcaggc acgcatcccg gcgcaggcat tcctggcgtc 2700 
atcggctcgg caaaagcgac agcaggtttg atgctggagg atctgatttg aggtacctcg 2760 
acggccatgc aggccgatcc ccgatcgttc aaacatttgg caataaagtt tcttaagatt 2820 
gaatcctgtt gccggtcttg cgatgattat catataattt ctgttgaatt acgttaagca 2880 
tgtaataatt aacatgtaat gcatgacgtt atttatgaga tgggttttta tgattagagt 2940 
cccgcaatta tacatttaat acgcgataga aaacaaaata tagcgcgcaa actaggataa 3000 
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attatcgcgc 


gcggtgtcat 


ctatgttact 


agatcgggcc 


ttaatgttcg 


gggcgaacat 


3060 


cgcaagcttg ttaatcatgg tgtaggcaac 


ccaaataaaa 


caccaaaata 


tgcacaaggc 


3120 


agtttgttgt attctgtagt acagacaaaa 


ctaaaagtaa 


tgaaagaaga 


tgtggtgtta 


3180 


gaaaaggaaa 


caatatcatg 


agtaatgtgt 


gagcattatg 


ggaccacgaa 


ataaaaagaa 


3240 


cattttgatg 


agtcgtgtat 


cctcgatgag 


cctcaaaagt 


tctctcaccc 


cggataagaa 


3300 


acccttaagc 


aatgtgcaaa 


gtttgcattc 


tccactgaca 


taatgcaaaa 


taagatatca 


3360 


tcgatgacat 


agcaactcat 


gcatcatatc 


atgcctctct 


caacctattc 


attcctactc 


3420 


atctacataa 


gtatcttcag 


ctaaatgtta 


gaacataaac 


ccataagtca 


cgtttgatga 


3480 


gtattaggcg 


tgacacatga 


caaatcacag 


actcaagcaa 


gataaagcaa 


aatgatgtgt 


3540 


' acataaaact 


ccagagctat 


atgtcatatt 


gcaaaaagag 


gagagcttat 


aagacaaggc 


3600 


atgactcaca 


aaaattcatt 


tgcctttcgt 


gtcaaaaaga 


ggagggcttt 


acattatcca 


3660 


tgtcatattg 


caaaagaaag 


agagaaagaa 


caacacaatg 


ctgcgtcaat 


tatacatatc 


3720 


tgtatgtcca 


tcattattca 


tccacctttc 


gtgtaccaca 


cttcatatat 


catgagtcac 


3780 


ttcatgtctg 


gacattaaca 


aactctatct 


taacatttag 


atgcaagagc 


ctttatctca 


3840 


ctataaatgc 


acgatgattb 


ctcattgttt 


ctcacaaaaa 


gcattcagtt 


cattagtcct 


3900 


acaacaacga 


attcggcttc 


ccgggtacag 


ggtaaatttc 


tagtttttct 


ccttcatttt 


3960 


cttggttagg 


acccttttct 


ctttttattt 


tfcttgagctt 


tgatctttct 


ttaaactgat 


4020 


ctatttttta 


attgattggt 


tatcgtgtaa 


atattacata 


gctttaactg 


ataatctgat 


4080' 


tactttattt 


cgtgtgtctt 


tgatcatctt 


gatagttaca 


gaaccgtcga 


ctct'agagaa 


4140 


gccatttaaa 


tcgccgccac 


catgtctgtt 


gccttgttat 


gggttgtttc 


tccttgtgac 


4200 


gtctcaaacg ggacaggatt 


cttggtatcc 


gttcgtgagg 


gaaaccggat 


ttttgattcg 


4260 


tcggggcgta 


ggaatttggc 


gtgcaatgag 


agaatcaaga 


gaggaggtgg 


aaaacaaagg 


4320 


tggagttttg gttcttactt gggaggagca 


caaactggaa 


gtggacggaa 


attttctgta 


4380 


cgttctgcta 


tcgtggctac 


tccggctgga 


gaaatgacga 


tgtcatcaga 


acggatggta 


4440 


tatgatgtgg ttttgaggca 


ggcagccttg 


gtgaagagac 


agctgagatc 


gaccgatgag 


4500 


ttagatgtga 


agaaggatat 


acctattccg 


gggactttgg 


gcttgttgag 


tgaagcatat 


4560 


gataggtgta 


gtgaagtatg 


tgcagagtac 


gcaaagacgt 


tttacttagg 


aacgatgcta 


4620 


atgactccgg 


agagaagaaa 


ggctatctgg 


gcaatatacg 


tatggtgcag 


gagaacagac 


4680 


gaacttgttg 


atggtccgaa 


tgcatcacac 


attactccgg 


cggccttaga 


taggtgggaa 


4740 


gacaggctag 


aagatgtttt 


cagtggacgg 


ccatt tgaca 


tgctcgatgc 


tgctttgtcc 


4800 
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gacacagttt ccaaatttcc agttgatatt cagccattca gagatatgat tgaaggaatg 4860 

cgtatggact tgaggaagtc aagatacaga aactttgacg aactatacct atattgttat 4920 

tacgttgctg gtacggttgg gttgatgagt gttccaatta tgggcatcgc acctgaatca 4980 

aaggcaacaa cggagagcgt atataatgct gctttggctt tggggatcgc aaatcagctg 5040 

accaacatac ttagagatgt tggagaagat gccagaagag gaagagtcta tttgcctcaa 5100 

gatgaattag cacaggcagg tctatccgac gaagacatat ttgctggaag agtgaccgat 5160 

aaatggagaa tcttcatgaa gaaacaaatt cagagggcaa gaaagttctt tgacgaggca 5220 

gagaaaggag tgaccgaatt gagcgcagct agtagatggc ctgtgttggc atctctgctg 52 80 

ttgtaccgca ggatactgga cgagatcgaa gccaatgact acaacaactt cacaaagaga 534 0 

gcttatgtga gcaaaccaaa gaagttgatt gcattaccta ttgcatatgc aaaatctctt 5400 

gtgccttcta caagaacatg aaatcaggat tttatataaa tcaaggccaa tgaagccaat 5460 

atacatttag aagaaaaaaa acaagtgttt ataaagtaga attattgaag gggaggcttg 5520 

gagtaactgg taaagttgtt gtcatgtgac tgggaagtca cgggttcaag ccttggaaac 5580 

agcctctggc agaaatgcaa ggtaaggttg cgtacaatat accgttaagg tggggtcctt 5640 

cccagtacao cgcgcatagc gatagattta gtgcaccggg tcgccttttt tctaaagtag 5700 

ggccatgcag gccgatcccc gatcgttcaa acatttggca ataaagtttc ttaagattga 5760 

atcctgttgc cggtcttgcg atgattatca tataatttct gttgaattac gttaagcatg 5820 

taataattaa catgtaatgc atgacgtfcat ttatgagatg ggtttttatg attagagtcc 5880 

cgcaattata catttaatac gcgatagaaa acaaaatata gcgcgcaaac taggataaat 5940 

tatcgcgcgc ggtgtcatct atgttactag atcg sg74 

<210> 8 
<211> 5782 
<212> DHA 

<213> SYNTHETIC - 7650 



<400> 8 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgcacaagg cagtttgttg 60 

tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 120 

acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat iao 

gagtcgtgta tcctcgatga gcctcaaaag ttctctcacc ccggataaga aacccttaag 240 
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caatgtgcaa 


agtttgcatt 


ctccactgac 


ataatgcaaa 


ataagatatc 


atcgatgaca 


300 


tagcaactca 


tgcatcatat 


catgcctctc 


tcaacctatt 


cattcctact 


catctacata 


360 


agtatcttca 


gctaaatgtt 


agaacataaa 


cccataagtc 


acgtttgatg 


agtattaggc 


420 


gtgacacatg 


acaaatcaca 


gactcaagca 


agataaagca 


aaatgatgtg 


tacataaaac 


480 


tccagagcta 


tatgtcatat 


tgcaaaaaga 


ggagagctta 


taagacaagg 


catgactcac 


540 


aaaaattcat 


ttgcctttcg 


tgtcaaaaag 


aggagggctt 


tacattatcc 


atgtcatatt 


600 


gcaaaagaaa 


gagagaaaga 


acaacacaat 


gctgcgtcaa 


ttatacatat 


ctgtatgtcc 


660 


atcattattc 


atccaccttt 


cgtgtaccac 


acttcatata 


tcatgagtca 


cttcatgtct 


720 


ggacattaac 


aaactctatc 


ttaacattta 


gatgcaagag 


cctttatctc 


actataaatg 


780 


cacgatgatt 


tctcattgtt 


tctcacaaaa 


agcattcagt 


tcattagtcc 


tacaacaacg 


840 


aattcggctt 


cccgggtaca 


gggtaaattt 


ctagtttttc 


tccttcattt 


tcttggttag 


900 


gacccttttc 


tctttttatt 


tttttgagct 


ttgatctttc 


tttaaactga 


tctattttfct 


960 


aattgattgg 


ttatcgtgta 


aatattacat 


agctttaact 


gataatctga 


ttactttatt 


1020 


tcgtgtgtct 


ttgatcatct 


tgatagttac 


agaaccgtcg 


actctagaga 


agccatttaa 


1080 


atcgccgcca 


ccatggcttc 


tatgatatcc 


tcttccgctg 


tgacaacagt 


cagccgtgcc 


1140 


tctagggggc 


aatccgccgc 


agtggctcca 


ttcggcggcc 


tcaaatccat gactggattc 


1200 


ccagtgaaga 


aggtcaacac 


tgacattact 


tccattacaa 


gcaatggtgg 


aagagtaaag 


1260 


tgcatggcgg 


ccgccaaacc 


aactacggta 


attggtgcag 


gcttcggtgg 


cctggcactg 


1320 


gcaattcgtc 


tacaagctgc 


ggggatcccc 


gtcttactgc 


ttgaacaacg 


tgataaaccc 


1380 


ggcggtcggg 


cttatgtcta 


cgaggatcag 


gggtttacct 


ttgatgcagg 


cccgacggtt 


1440 


atcaccgatc 


ccagtgccat 


tgaagaactg 


tttgcactgg 


caggaaaaca gttaaaagag 


1500 


tatgtcgaac 


tgctgccggt 


tacgccgttt 


taccgcctgt 




agggaaggtc 


1560 


tttaattacg 


ataacgatca 


aacccggctc 


gaagcgcaga 


ttcagcagtt 


taatccccgc 


1620 


gatgtcgaag 


gttatcgtca 


gtttctggac 


tattcacgcg 


cggtgtttaa 


agaaggctat 


1680 


ctgaagctcg 


gtactgtccc 


ttttttatcg 


ttcagagaca 


tgcttcgcgc 


cgcacctcaa 


1740 


ctggcgaaac 


tgcaggcatg 


gagaagcgtt 


tacagtaagg 


ttgccagtta 


catcgaagat 


1800 


gaacatctgc 


gccaggcgtt 


ttctttccac 


tcgctgttgg 


tgggcggcaa 


tcccttcgcc 


1860 


acctcatcca 


tttatacgtt 


gatacacgcg 


ctggagcgtg 


agtggggcgt 


ctggtttccg 


1920 


cgtggcggca 


ccggcgcatt 


agttcagggg 


atgataaagc 


tgtttcagga 


tctgggtggc 


1980 


gaagtcgtgt 


taaacgccag 


agtcagccat 


atggaaacga 


caggaaacaa 


gattgaagcc 


2040 
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gtgcatttag aggacggtcg caggttcctg acgcaagccg tcgcgtcaaa tgcagatgtg 2100 
gttcatacct atcgcgacct gttaagccag caccctgccg cggttaagca gtccaacaaa 2160 
ctgcagacta agcgcatgag taactctctg tttgtgctct attttggttt gaatcaccat 2220 
catgatcagc tcgcgcatca cacggtttgt ttcggcccgc gttaccgcga gctgattgac 2280 
gaaattttta atcatgatgg cctcgcagag gacttctcac tttatctgca cgcgccctgt 2340 
gtcacggatt cgtcactggc gcctgaaggt tgcggcagtt actatgtgtt ggcgccggtg 2400 
ccgcatttag gcaccgcgaa cctcgactgg acggttgagg ggccaaaact acgcgaccgt 24 60 
atttttgcgt accttgagca gcattacatg cctggcttac ggagtcagct ggtcacgcac 2520 
cggatgttta cgccgtttga ttttcgcgac cagcttaatg cctatcatgg ctcagccttt 2580 
tctgtggagc ccgttcttac ccagagcgcc tggtttcggc cgcataaccg cgataaaacc 2640 
attactaatc tctacctggt cggcgcaggc acgcatcccg gcgcaggcat tcctggcgtc 2 700 
atcggctcgg caaaagcgac agcaggtttg atgctggagg atctgatttg aggtacctcg 2760 
acggccatgc aggccgatcc ccgatcgttc aaacatttgg caataaagtt tcttaagatt 2820 
gaatcctgtt gccggtcttg cgatgattat catataattt ctgttgaatt acgttaagca 2880 
tgtaataatt aacatgtaat gcatgacgtt atttatgaga tgggttttta tgattagagt 2940 
cccgcaatta tacatttaat acgcgataga aaacaaaata tagcgcgcaa actaggataa 3000 
attatcgcgc gcggtgtcat ctatgttact agatcgggcc ttaatcgcaa gcttgttaat 3060 
catggtgtag gcaacccaaa taaaacacca aaatatgcac aaggcagttt gttgtattct 3120 
gtagtacaga caaaactaaa agtaatgaaa gaagatgtgg tgttagaaaa ggaaacaata 3180 
tcatgagtaa tgtgtgagca ttatgggacc acgaaataaa aagaacattt tgatgagtcg 3240 
tgtatcctcg atgagcctca aaagttctct caccccggat aagaaaccct taagcaatgt 3300 
gcaaagtttg catfcctccac tgacataatg caaaataaga tatcatcgat gacatagcaa 3360 
ctcatgcatc atatcatgcc tctctcaacc tattcattcc tactcatcta cataagtatc 3420 
ttcagctaaa tgttagaaca taaacccata agtcacgttt gatgagtatt aggcgtgaca 3480 
catgacaaat cacagactca agcaagataa agcaaaatga tgtgtacata aaactccaga 3540 
gctatatgtc atattgcaaa aagaggagag cttataagac aaggcatgac tcacaaaaat 3600 
tcatttgcct ttcgtgtcaa aaagaggagg gctttacatt atccatgtca tattgcaaaa 3660 
gaaagagaga aagaacaaca caatgctgcg tcaattatac atatctgtat gtccatcatt 3 72 0 
attcatccac ctttcgtgta ccacacttca tatatcatga gtcacttcat gtctggacat 3780 
taacaaactc tatcttaaca tttagatgca agagccttta tctcactata aatgcacgat 3840 
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gatttctcat 


tgtttctcac 


aaaaagcatt 


cagttcatta 


gtcctacaac 


aacgaattcg 


3900 


gcttcccggg 


tacagggtaa 


atttctagtt 


tttctccttc 


attttcttgg 


ttaggaccct 


3960 


tttctctttt 


tatttttttg 


agctttgatc 


tttctttaaa 


ctgatctatt 


ttttaattga 


4020 


ttggttatcg 


tgtaaatatt 


acatagcttt 


aactgataat 


ctgattactt 


tatttcgtgt 


4080 


gtctttgatc 


atcttgatag 


ttacagaacc 


gtcgactcta 


gagaagccat 


ttaaatcgcc 


4140 


gccaccatgt 


ctgttgcctt 


gttatgggtt 


gtttctcctt 


gtgacgtctc 


aaatgggaca 


4200 


agtfctcatgg 


aatcagtccg 


ggagggaaac 


cgtttttttg 


attcatcgag 


gcataggaat 


4260 


ttggtgtcca 


atgagagaat 


caatagaggt 


ggt99 aaa 9C 


aaactaataa 


tggacggaaa 


4320 


ttttctgtac 


ggtctgctat 


tttggctact 


ccatctggag 


aacggacgat 


gacatcggaa 


4380 


cagatggtct 


atgatgtggt 


tttgaggcag 


gcagccttgg 


tgaagaggca 


actgagatct 


4440 


accaatgagt 


tagaagtgaa 


gccggatata 


cctattccgg 


ggaatttggg 


cttgttgagt 


4500 


gaagcatatg 


ataggtgtgg 


tgaagtatgt 


gcagagtatg 


caaagacgtt 


taacttagga 


4560 


actatgctaa 


tgactcccga 


gagaagaagg 


gctatctggg 


caatatatgt 


atggtgcaga 


4620 


agaacagatg 


aacttgttga 


tggcccaaac 


gcatcatata 


ttaccccggc 


agccttagat 


4680 


aggtgggaaa 


ataggctaga 


agatgttttc 


aatgggcggc 


catttgacat 


gctcgatggt 


4740 


gctttgtccg. atacagtttc taactttcca gttgatattc agccattcag 


agatatgatt 


4800 


gaaggaatgc gtatggactt 


gagaaaatcg 


agatacaaaa 


acttcgacga 


actatacctt 


4860 


tattgttatt 


atgttgctgg 


tacggttggg 


ttgatgagtg 


ttccaattat 


gggtatcgcc 


4 920 


cctgaatcaa 


aggcaacaac 


agagagcgta 


tataatgctg 


ctttggctct 


gggg atc g ca 


4980 


aatcaattaa 


ctaacatact 


cagagatgtt 


ggagaagatg 


ccagaagagg 


aagagtctac 


5040 


ttgcctcaag 


atgaattagc 


acaggcaggt 


ctatccgatg 


aagatatatt 


tgctggaagg 


5100 


gtgaccgata 


aatggagaat 


ctttatgaag 


aaacaaatac 


atagggcaag 


aaagttcttt 


5160 


gatgaggcag 


agaaaggcgt 


gacagaattg 


agctcagcta 


gtagattccc 


tgtatgggca 


5220 


tctttggtct 


tgtaccgcaa 


aatactagat 


gagattgaag 


ccaatgacta 


caacaacttc 


5280 


acaaagagag 


catatgtgag 


caaatcaaag 


aagttgattg 


cattacctat 


tgcatatgca 


5340 


aaatctcttg 


tgcctcctac 


aaaaactgcc 


tctcttcaaa 


gataaagcat 


gaaatgaaga 


5400 


tatatatata 


tatatatata 


gcaatataca 


ttagaagaaa 


aaaaggaaga 


agaaatgttg 


5460 


ttgtattgat 


ataaatgtat 


atcataaata 


ttaggttgta 


gtaacattgg 


ccatgcaggc 


5520 


cgatccccga 


tcgttcaaac 


atttggcaat 


aaagtttctt 


aagattgaat 


cctgttgccg 


5580 


gtctfcgcgat 


gattatcata 


taatttctgt 


tgaattacgt 


taagcatgta 


ataattaaca 


5640 



WO 2004/085656 PCT/GB2004/001241 

-28- 

tgtaatgcat gacgttattt atgagatggg tttttatgat tagagtcccg caattataca 5700 

tttaatacgc gatagaaaac aaaatatagc gcgcaaacta ggataaatta tcgcgcgcgg 5760 

tgtcatctat gttactagat eg 5782 

<210> 9 

<211> 5551 

<212> DNA 

<213> SYNTHETIC 



<400> .9 

gttaatcatg gtgtaggcaa cccaaataaa acaccaaaat atgeacaagg cagtttgttg 60 

tattctgtag tacagacaaa actaaaagta atgaaagaag atgtggtgtt agaaaaggaa 120 

acaatatcat gagtaatgtg tgagcattat gggaccacga aataaaaaga acattttgat 180 

gagtcgtgta tcctcgatga gectcaaaag ttctctcacc ceggataaga aacccttaag 240 

caatgtgcaa agtttgcatt ctccactgac ataatgeaaa ataagatatc atcgatgaca 3 00 

tagcaactca tgcatcatat catgcctctc tcaacctatt cattcctact catctacata 360 

agtatcttca gctaaatgtt agaacataaa cccataagtc acgtttgatg agtattaggc 420 

gtgacacatg acaaatcaca gactcaagca agataaagca aaatgatgtg tacataaaac t 480 

tccagagcta tatgtcatat tgcaaaaaga ggagagctta taagacaagg catgactcac 540 

aaaaattcat ttgecttteg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 600 

gcaaaagaaa gagagaaaga acaacacaat getgegtcaa ttatacatat ctgtatgtcc 660 

atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 720 

ggacattaac aaactctatc ttaacattta gatgeaagag cctttatctc actataaatg 780 

cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 84 0 

aatteggett cccgggtaca gggtaaattt ctagtttttc tccttcattt tcttggttag 900 

gacccttttc tctttttatt tttttgagct ttgatctttc tttaaactga tctatttttt 960 

aattgattgg ttatcgtgta aatattacat agctttaact gataatctga tfcactttatt 1020 

tcgtgtgtct ttgatcatct tgatagttac agaacegteg actctagaga agecatttaa 1080 

atcgccgcca ccatggcttc tatgatatcc tcttccgctg tgacaacagt cagccgtgcc 1140 

tctagggggc aatccgccgc agtggctcca ttcggcggcc tcaaatccat gactggattc 1200 

ccagtgaaga aggtcaacac tgacattact tccattacaa gcaatggtgg aagagtaaag 1260 
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tgcatggcgg ccgccaaacc aactacggta attggtgcag gcttcggtgg cctggcactg 1320 

gcaattcgtc tacaagctgc ggggatcccc gtcttactgc ttgaacaacg tgataaaccc 1380 

ggcggtcggg cttatgtcta cgaggatcag gggtttacct ttgatgcagg cccgacggtt 1440 

atcaccgatc ccagtgccat tgaagaactg tttgcactgg caggaaaaca gttaaaagag 1500 

tatgtcgaac tgctgccggt tacgccgttt taccgcctgt gttgggagtc agggaaggtc 1560 

tttaattacg ataacgatca aacccggctc gaagcgcaga ttcagcagtt taatccccgc 162 0 

gatgtcgaag gttatcgtca gtttctggac tattcacgcg cggtgtttaa agaaggctat 1680 

ctgaagctcg gtactgtccc ttttttatcg ttcagagaca tgcttcgcgc cgcacctcaa 1740 

ctggcgaaac tgcaggcatg gagaagcgtt -tacagtaagg ttgccagtta catcgaagat 1800 

gaacatctgc gccaggcgtt ttctttccac tcgctgttgg tgggcggcaa tcccttcgcc 1860 

acctcatcca tttatacgtt gatacacgcg ctggagcgtg agtggggcgt ctggtttccg 1920 

cgtggcggca ccggcgcatt agttcagggg atgataaagc' tgtttcagga tctgggtggc 198 0 

gaagtcgtgt taaacgccag agtcagccat atggaaacga caggaaacaa gattgaagcc 2 04 0 

gtgcatttag aggacggtcg caggttcctg acgcaagccg tcgcgtcaaa tgcagatgtg 2100 

gttcatacct atcgcgacct gttaagccag caccctgccg cggttaagca gtccaacaaa 2160 

ctgcagacta agcgcatgag taactctctg tttgtgctct attttggttt gaatcaccat 222 0 

catgatcagc tcgcgcatca cacggtttgt ttcggcccgc gttaccgcga gctgattgac 228 0 

gaaattttta atcatgatgg cctcgcagag gacttctcac tttatctgca cgcgccctgt 2340 

gtcacggatt cgtcactggc gcctgaaggt tgcggcagtt actatgtgtt ggcgccggtg 24 00 

ccgcatttag gcaccgcgaa cctcgactgg acggttgagg ggccaaaact acgcgaccgt 24 60 

atttttgcgt accttgagca gcattacatg cctggcttac ggagtcagct ggtcacgcac 2520 

cggatgttta cgccgtttga ttttcgcgac cagcttaatg cctatcatgg ctcagccttt 2580 

tctgtggagc ccgttcttac ccagagcgcc tggtttcggc cgcataaccg cgataaaacc 2640 

attactaatc tctacctggt cggcgcaggc acgcatcccg gcgcaggcat tcctggcgtc 2700 

atcggctcgg caaaagcgac agcaggtttg atgctggagg atctgatttg aggtacctcg 2760 

acggccatgc aggccgatcc ccgatcgttc aaacatttgg caataaagtt tcttaagatt 2820 

gaatcctgtt gccggtcttg cgatgattat catataattt ctgttgaatt acgttaagca 2880 

tgtaataatt aacatgtaat gcatgacgtt atttatgaga tgggttttta tgattagagt 2940 

cccgcaatta tacatttaat acgcgataga aaacaaaata tagcgcgcaa actaggataa 3 000 

attatcgcgc gcggtgtcat ctatgttact agatcgggcc ttaatcgcaa gcttgttaat 3 060 
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catggtgtag gcaacccaaa taaaacacca aaatatgcac aaggcagttt gttgtattct 3120 
gtagtacaga caaaactaaa agtaatgaaa gaagatgtgg tgttagaaaa . ggaaacaata 3180 
tcatgagtaa tgtgtgagca ttatgggacc acgaaataaa aagaacattt tgatgagtcg 3240 
tgtatcctcg atgagcctca aaagttctct caccccggat aagaaaccct taagcaatgt 3300 
gcaaagtttg cattctccac tgacataatg caaaataaga tatcatcgat gacatagcaa 3360 
ctcatgcatc atatcatgcc tctctcaacc tattcattcc tactcatcta cataagtatc 3420 
ttcagctaaa tgttagaaca taaacccata agtcacgttt gatgagtatt aggcgtgaca 3480 
catgacaaat cacagactca agcaagataa agcaaaatga tgtgtacata aaactccaga 3540 
gctatatgtc atattgcaaa aagaggagag cttataagac aaggcatgac tcacaaaaat 3 600 
tcatttgcct ttcgtgtcaa aaagaggagg gctttacatt atccatgtca tattgcaaaa 3 660 
gaaagagaga aagaacaaca caatgctgcg tcaattatac atatctgtat gtccatcatt 3720 
attcatccac ctttcgtgta ccacacttca tatatcatga gtcacttcat gtctggacat 3780 
taacaaactc tatcttaaca tttagatgca agagccttta tctcactata aatgcacgat 3 84 0 
gatttctcat tgtttctcac aaaaagcatt cagttcatta gtcctacaac aacgaattcg 3 900 
gcttcccggg tacagggtaa atttctagtt tttctccttc attttcttgg ttaggaccct 3 960 
tttctctttt tatttttttg agctttgatc tttctttaaa ctgatctatt ttttaattga 402 0 
ttggttatcg tgtaaatatt acatagcttt aactgataat ctgattactt tatttcgtgt 4 08 0 
gtctttgatc atcttgatag ttacagaacc gtcgactcta gagaagccat ttaaatcgcc 4140 
gccaccatgg cttctatgat atcctcttcc gctgtgacaa cagtcagccg tgcctctagg 4200. 
gggcaatccg ccgcagtggc tccattcggc ggcctcaaat ccatgactgg attcccagtg 4260 
aagaaggtca acactgacat tacttccatt acaagcaatg gtggaagagt aaagtgcatg 4320 
gcagttggct cgaaaagttt tgcgacagcc tcaaagttat ttgatgcaaa aacccggcgc 4380 
agcgtactga tgctctacgc ctggtgccgc cattgtgacg atgttattga cgatcagacg 4440 
ctgggctttc aggcccggca gcctgcctta caaacgcccg aacaacgtct gatgcaactt 45 00 
gagafcgaaaa cgcgccaggc ctatgcagga tcgcagatgc acgaaccggc gtttgcggct 4560 
tttcaggaag tggctatggc tcatgatatc gccccggctt acgcgtttga tcatctggaa 4620 
ggcttcgcga tggatgtacg cgaagcgcaa tacagccaac tggatgatac gctgcgctat 4 680 
tgctatcacg ttgcaggcgt tgtcggcttg atgatggcgc aaatcatggg cgtgcgggat 4740 
aacgccacgc tggaccgcgc ctgtgacctt gggctggcat ttcagttgac caatattgct 4800 
cgcgatattg tggacgatgc gcatgcgggc cgctgttatc tgccggcaag ctggctggag 4860 
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catgaaggtc 


tgaacaaaga 


gaattatgcg gcacctgaaa accgtcaggc gcfcgagccgt 


4920 


atcgcccgac 


gtttggtgca 


ggaagcagaa 


ccttactatt 


tgtctgccac 


agccggcctg 


4980 


gcagggttgc 


ccctgcgttc 


cgcctgggca 


atcgctacgg 


cgaagcaggt 


ttaccggaaa 


5040 


ataggtgtca 


aagttgaaca 


ggccggtcag 


caagcctggg 


atcagcggca 


gtcaaegacc 


5100 


acgcccgaaa 


aattaacgct 


gctgctggcc 


gcctctggtc 


aggcccttac 


ttcccggatg 


5160 


cgggctcatc 


ctccccgccc 


tgcgcatctc 


tggcagcgcc 


cgctctaggg 


atccgttaag 


5220 


ggcgaattcc 


agcacactgg 


cggccgttac 


tagtggatcc 


gagctcggta 


cctcgacggc 


5280 


catgcaggcc 


gatccccgat 


cgttcaaaca 


tttggcaata 


aagtttctta 


agattgaatc 


5340 


ctgttgccgg 


tcttgcgatg 


attatcatat 


aatttctgtt 


gaattacgtt 


aagcatgtaa 


5400 


taattaacat 


gtaatgcatg 


acgttattta 


tgagatgggt 


ttttatgatt 


agagtcccgc 


5460 


aattatacat 


ttaatacgcg 


atagaaaaca 


aaatatagcg 


cgcaaactag 


gataaattat 


5520 


cgcgcgcggt 


gtcatctatg 


ttactagatc 


g 






5551 



<210> 10 

<211> 1233 

<212> DNA 

<213> Zea mays 



<400> 10 
atggccatca 


tactcgtacg 


agcagcgtcg 


ccggggctct 


ccgccgccga 


cagcatcagc 


60 


caccagggga 


ctctccagtg ctccaccctg 


ctcaagacga 


a 9 a gg cc 99 c 


ggcgcggcgg 


120 


tggatgccct 


gctcgctcct 


tggcctccac 


ccgtgggagg 


ctggccgtcc 


ctcccccgcc 


180 


gtctactcca 


gcctgcccgt 


caacccggcg 


ggagaggccg 


tcgtctcgtc 


cgagcagaag 


240 


gtctacgacg 


tcgtgctcaa 


gcaggccgca 


ttgctcaaac 


gccagctgcg 


cacgccggtc 


3 00 


ctcgacgcca 


ggccccagga 


catggacatg 


ccacgcaacg ggctcaagga 


agcctacgac 


360 


cgctgcggcg 


agatctgtga 


ggagtatgcc 


aagacgtttt 


acctcggaac 


tatgttgatg 


420 


acagaggagc 


ggcgccgcgc 


catatgggcc 


atctatgtgt ggtgtaggag gacagatgag 


480 


cttgtagatg 


ggccaaacgc 


caactacatt 


acaccaacag 


ctttggaccg 


gtgggagaag 


540 


agacttgagg 


atctgttcac 


gggacgtcct 


tacgacatgc 


ttgatgccgc 


tctctctgat 


600 


accatctcaa 


ggttccccat 


agacattcag 


ccattcaggg 


acatgattga 


agggatgagg 


660 


agtgatctta 


ggaagacaag 


gtataacaac 


ttcgacgagc 


tctacatgta 


ctgctactat 


720 
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gttgctggaa ctgtcgggtt aatgagcgta cctgtgatgg gcatcgcaac cgagtctaaa 780 

gcaacaactg aaagcgtata cagtgctgcc ttggctctgg gaattgcgaa ccaactcacg 840 

aacatactcc gggatgttgg agaggatgct agaagaggaa ggatatattt accacaagat 90 0 

gagcttgcac aggcagggct ctctgatgag gacatcttca aaggggtcgt cacgaaccgg 960 

tggagaaact tcatgaagag gcagatcaag agggccagga tgttttttga ggaggcagag 1020 

agaggggbaa ctgagctctc acaggctagc agatggccag tatgggcttc cctgttgttg 1080 

tacaggcaga tcctggatga gatcgaagcc aacgactaca acaacttcac gaagagggcg 114 0 

tatgttggta aagggaagaa gttgctagca cttcctgtgg catafcggaaa atcgctactg 1200 

ctcccatgtt cattgagaaa tggccagacc tag 2233 

<210> 11 
<211> 1233 
<212> DNA 
<213> Zea mays 

<400> 11 

atggccatca tactcgtacg agcagcgtcg ccggggctct ccgccgccga cagcatcagc 60 

caccagggga ctctccagtg ctccaccctg ctcaagacga agaggccggc ggcgcggcgg 120 

tggatgccct gctcgctcct tggcctccac ccgtgggagg ctggccgtcc ctcccccgcc 180 

gtctactcca gcctgcccgt caacccggcg ggagaggccg tcgtctcgtc cgagcagaag 240 

gtctacgacg tcgtgctcaa gcaggccgca ttgctcaaac gccagctgcg cacgccggtc 300 

ctcgacgcca ggccccagga catggacatg ccacgcaacg ggctcaagga agcctacgac 360 

cgctgcggcg agatctgtga ggagtatgcc aagacgtttt acctcggaac tatgttgatg 420 

acagaggagc ggcgccgcgc catatgggcc atctatgtgt ggtgtaggag gacagatgag 48 0 

cttgtagatg ggccaaacgc caactacatt acaccaacag ctttggaccg gtgggagaag 540 

agacttgagg atctgttcac gggacgtcct tacgacatgc ttgatgccgc tctctctgat 600 

accatctcaa ggttccccat agacattcag ccattcaggg acatgattga agggatgagg 660 

agtgatctta ggaagacaag gtataacaac ttcgacgagc tctacatgta ctgctactat 720 

gttgctggaa ctgtcgggtt aatgagcgta cctgtgatgg gcatcgcaac cgagtctaaa 780 

gcaacaactg aaagcgtata cagtgctgcc ttggctctgg gaattgcgaa ccaactcacg 840 

aacatactcc gggatgttgg agaggatgct agaagaggaa ggatatattt accacaagat 900 
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gagcttgcac aggcagggct ctctgatgag gacatcttca aaggggtcgt cacgaaccgg 960 

tggagaaact tcatgaagag gcagatcaag agggccagga tgttttttga ggaggcagag 1020 

agaggggtaa atgagctctc acaggctagc agatggccag tatgggcttc cctgttgttg 1080 

tacaggcaga tcctggatga gatcgaagcc aacgactaca acaacttcac gaagagggcg 114 0 

tatgttggta aagggaagaa gttgctagca cttcctgtgg catatggaaa atcgctactg 1200 

ctcccatgtt cattgagaaa tggccagacc tag 1233 

<210> 12 

<211> 1233 

<212> DNA 

<213> Zea mays 



<400> 12 

atggccatca tactcgtacg agcagcgtcg ccggggctct ccgccgccga cagcatcagc 60 

caccagggga ctctccagtg ctccaccctg ctcaagacga agaggccggc ggcgcgccgg 120 

tggatgccct gcticgctcct tggcctccac ccgtgggagg ctggccgtcc ctcccccgcc 180 

gtctactcca gcctcgccgt caacccggcg ggagaggccg tcgtctcgtc cgagcagaag 24 0 

gtctacgacg tcgtgctcaa gcaggccgca ttgctcaaac gccagctgcg cacgccggtc 3 00 

ctcgacgcca ggccccagga catggacatg ccacgcaacg ggctcaagga agcctacgac 3 60. 

cgctgcggcg agatctgtga ggagtatgcc aagacgtttt acctcggaac tatgttgatg 42 0 

acagaggagc ggcgccgcgc catatgggcc atctatgtgt ggtgtaggag gacagatgag 4 80 

cttgtagatg ggccaaacgc caactacatt acaccaacag ctttggaccg gtgggagaag 54 0 

agacttgagg atctgttcac gggacgtcct tacgacatgc ttgatgccgc tctctctgat 600 

accatctcaa ggttccccat agacattcag ccattcaggg acatgattga agggatgagg 660 

agtgatctta ggaagacaag gtataacaac ttcgacgagc tctacatgta ctgctactat 720 

gttgctggaa ctgtcgggtt aatgagcgta ccagtgatgg gcatcgcatc cgagtctaaa 780 

gcaacaactg aaagcgtgta cagtgctgcc ttggctctcg gaattgcgaa ccaactcacg 840 

aacatactcc gggatgttgg agaggatgct agacgaggaa ggatatattt accacaagat 900 

gagcttgcac aggcagggct ctctgatgag gacatcttca aaggggtcgt cacgaaccgg 960 

tggagaaact tcatgaagag gcagatcaag agggccagga tgttttttga ggaggcagag 1020 

agaggggtaa ctgagctctc acaggctagc agatggccag tatgggcttc cctgttgttg 1080 
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tacaggcaga tcctggatga gatcgaagcc aacgactaca acaacttcac gaagagggcg 1140 

tatgttggta aagggaagaa gttgctagca cttcctgtgg catatggaaa atcgctactg 1200 

ctcccatgtt cattgagaaa tggccagacc tag 1233 

<210> 13 

<211> 1263 

<212> DNA 

<213> Oryga sp. 

<400> 13 

atggcggcca tcacgctcct acgttcagcg tctcttccgg gcctctccga cgccctcgcc 60 

cgggacgctg ctgccgtcca acatgtctgc tcctcctacc tgcccaacaa caaggagaag 120 

aagaggaggt ggatcctctg ctcgctcaag tacgcctgcc ttggcgtcga ccctgccccg 180 

ggcgagattg cccggacctc gccggtgtac tccagcctca ccgtcacccc tgctggagag 240 

gccgtcatct cctcggagca gaaggtgtac gacgtcgtcc tcaagcaggc agcattgctc 300 

aaacgccacc tgcgcccaca ■ accacacacc attcccatcg ttcccaagga cctggacctg 360 

ccaagaaacg gcctcaagca -ggcctatcat cgctgcggag agatctgcga ggagtatgcc 420 

aagacctttt accttggaac tatgctcatg acggaggacc gacggcgcgc catatgggcc 4 80 

atctatgtgt ggtgtaggag gacagatgag cttgtagatg gaccaaatgc ctcgcacatc 540 

acaccgtcag ccctggaccg gtgggagaag aggcttgatg atctcttcac cggacgccoc 600 

tacgacatgc ttgatgctgc actttctgat accatctcca agtttcctat agatattcag 660 

cctttcaggg acatgataga agggatgcgg tcagacctca gaaagactag atacaagaac 720 

ttcgacgagc tctacatgta ctgctactat gttgctggaa ctgtggggct aatgagtgtt 780 

cctgtgatgg gtattgcacc cgagtcgaag gcaacaactg aaagtgtgta cagtgctgct 840 

ttggctctcg gcattgcaaa ccagctcaca aatatactcc gtgacgttgg agaggacgcg 900 

agaagaggga ggatatattt accacaagat gaacttgcag aggcagggct ctctgatgag 960 

gacatcttca atggcgttgt gactaaca'aa tggagaagct tcatgaagag acagatcaag 1020 

agagctagga tgttttttga ggaggcagag agaggggtga ccgagctcag ccaggcaagc 1080 

cggtggccgg tctgggcgtc tctgttgtta taccggcaaa tccttgacga gatagaagca 1140 

aacgattaca acaacttcac aaagagggcg tacgttggga aggcgaagaa attgctagcg' 1200 

cttccagttg catatggtag atcattgctg atgccctact cactgagaaa tagccagaag 1260 
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thus modified sequence to hybridise to a sequence in the Sequence Listing under the 
conditions described above. 

The present invention further provides a plant which comprises a polynucleotide 
or polynucleotide sequence as described above. In a particular embodiment said plant is 
a rice or a maize plant. 

In a further aspect of the present invention there is provided a polynucleotide or a 
polynucleotide sequence as described above wherein the promoter(s) are tissue preferred 
and/or organ preferred. In a particular embodiment the promoter(s) provide for 
prefferential expression in fruit. In a further embodiment said promoter(s) provide for 
high expression in fruit. In a still further embodiment said fruit is a banana fruit. 
In a further aspect of the present invention there is provided a polynucleotide comprising: 
(a) a region which comprises as operably linked components (i) a promoter which 
provides for fruit preferred expression; and (ii) a nucleotide sequence derived from a 
bacterium which sequence encodes a carotene desaturase; and (iii) a transcription 
termination region; and (b) a further region which comprises as operably linked 
components (i) a promoter which provides for fruit preferred expression; and (ii) a 
nucleotide sequence encoding a phytoene synthase which sequence is derived from maize 
(Zea sp.) or rice (Orzya sp.); and (iii) a transcription termination region. 
In a still further embodiment there is provided a method for increasing the carotenoid 
content of fruit comprising inserting into plant material a polynucleotide comprising: 
(a) a region which comprises as operably linked components (i) a promoter which 
provides for fruit preferred expression; and (ii) a nucleotide sequence derived from a 
bacterium which sequence encodes a carotene desaturase; and (iii) a transcription 
termination region; and (b) a further region which comprises as operably linked 
components (i) a promoter which provides for fruit preferred expression; and (ii) a 
nucleotide sequence encoding a phytoene synthase which sequence is derived from maize 
(Zea sp.) or rice (Orzya sp.); and (iii) a transcription termination region and regenerating 
a fruit-containing plant from said material and identifying the fruit which contain 
carotenoids at levels greater than those of control like-fruit The present invention still 
further provides polynucleotides v/hich comprise the phytoene synthase encoding 
sequences and carotene desaturase encoding sequences mentioned above which 
sequences are operably linked to promoters which provide for fruit preferred or tissue or 
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organ preferred expression. Such suitable promoters may be identified by the person 
skilled in the art. 

In a further aspect of the present invention there is provided the use of a 
polynucleotide or a polynucleotide sequence as described above in a method for the 
5 production of plants which are resistant and/or tolerant to a herbicide. 

In a still further aspect of the present invention there is provided a method for the 
production of a plant that is resistant and/or tolerant to a herbicide comprising inserting 
into plant material a polynucleotide or a polynucleotide sequence as described above and 
regenerating a morphologically normal plant from said material. The herbicide 
10 resistance and/or tolerance of the plant containing the polynucleotide or polynucleotide 
sequence of the invention can be compared to a control like-plant. The term control like- 
plant relates to plants which are substantially similar to those according to the invention 
but which control like-plant does not contain the polynucleotides or polynucleotide 
sequences according^ the invention. Typically, a control like-plant will comprise a 
15 plant of the same or similar plant species which control like-plant is a native plant or 
which has not been transformed. 

In a further aspect of the present invention there is provided a polynucleotide 
comprising the sequence depicted as SEQ ID NO: 13. In a particular embodiment there 
is provided a polynucelotide which consists of the sequence depicted as SEQ ID NO: 13. 
20 The present invention still further provides a polynucleotide which encodes the protein 
depicted as SEQ ID NO: 14. The present invention still further provides a polynucleotide 
sequence which has at least 87% identity to the sequence depicted as SEQ ID NO: 13 
wherein said sequence still encodes a phytoene synthase. The present invention still 
further provides a polynucleotide sequence which has at least 90% identity to the 
25 sequence depicted as SEQ ID NO: 13 wherein said sequence still encodes a phytoene 
synthase. The present invention still further provides a polynucleotide sequence which 
has at least 91% identity to the sequence depicted as SEQ ID NO: 13 wherein said 
sequence still encodes a phytoene synthase. The present invention still further provides a 
polynucleotide sequence which has at least 92% identity to the sequence depicted as SEQ 
30 ID NO: 13 wherein said sequence still encodes a phytoene synthase. The present 

invention still further provides a polynucleotide sequence which has at least 93% identity 
to the sequence depicted as SEQ ID NO: 13 wherein said sequence still encodes a 
phytoene synthase. The present invention still further provides a polynucleotide 
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sequence which has at least 94% identity to the sequence depicted as SEQ ID NO: 13 
wherein said sequence still encodes a phytoene synthase. The present invention still 
further provides a polynucleotide sequence which has at least 95% identity to the 
sequence depicted as SEQ ID NO: 13 wherein said sequence still encodes a phytoene 
synthase. The present invention still further provides a polynucleotide sequence which 
has at least 96% identity to the sequence depicted as SEQ ID NO: 13 wherein said 
sequence still encodes a phytoene synthase. The present invention still further provides a 
polynucleotide sequence which has at least 97% identity to the sequence depicted as SEQ 
ID NO: 13 wherein said sequence still encodes a phytoene synthase. The present 
invention still further provides a polynucleotide sequence which has at least 98% identity 
to the sequence depicted as SEQ ID NO: 13 wherein said sequence still encodes a 
phytoene synthase. The present invention still further provides a polynucleotide 
sequence which has at least 99% identity to the sequence depicted as SEQ ID NO: 13 
wherein said sequence still encodes a phytoene synthase. 

The present invention still further provides a protein having the sequence 
deipicted as SEQ ID NO: 14 or a variant which has at least 82% identity to SEQ 3D NO: 
14 wherein said variant still provides for phytoene synthase activity. In a further 
embodiment said variant has at least 85% identity to SEQ ID NO: 14 wherein said 
variant still provides for phytoene synthase activity. In a still further embodiment said 
variant has at least 90% identity to SEQ ID NO: 14 wherein said variant still provides for 
phytoene synthase activity. In a still further embodiment said variant has at least 91% 
identity to SEQ ID NO: 14 wherein said variant still provides for phytoene synthase 
activity. In a still further embodiment said variant has at least 92% identity to SEQ ID 
NO: 14 wherein said variant still provides for phytoene synthase activity. In a still further 
embodiment said variant has at least 93% identity to SEQ ID NO: 14 wherein said 
variant still provides for phytoene synthase activity. In a still further embodiment said 
variant has at least 94% identity to SEQ ID NO: 14 wherein said variant still provides for 
phytoene synthase activity. In a still further embodiment said variant has at least 95% 
identity to SEQ ID NO: 14 wherein said variant still provides for phytoene synthase 
activity. In a still further embodiment said variant has at least 96% identity to SEQ ID 
NO: 14 wherein said variant still provides for phytoene synthase activity. In a still further 
embodiment said variant has at least 97% identity to SEQ ID NO: 14 wherein said 
variant still provides for phytoene synthase activity. In a still further embodiment said 
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variant has at least 98% identity to SEQ ID NO: 14 wherein said variant still provides for 
phytoene synthase activity. In a still further embodiment said variant has at least 99% 
identity to SEQ ID NO: 1 4 wherein said variant still provides for phytoene synthase 
activity. In a still further embodiment there is provided a polynucleotide which encodes 
5 said variant. By phytoene synthase activity it is meant that the variant protein has the 
same or a similar function to the protein depicted as SEQ ID NO: 14. The percentage of 
sequence identity for proteins is determined by comparing two optimally aligned 
sequences over a comparison window, wherein the portion of the amino acid sequence in 
the comparison window may comprise additions or deletions (i.e. gaps) as compared to 
1 0 the initial reference sequence (which does not comprise additions or deletions) for 

optimal alignment of the two sequences. The percentage is calculated by determining the 
number of positions at which the identical amino acid residue occurs in both sequences to 
yield the number of match positions, dividing the number of match positions by the total 
number of positions in the window of comparison and multiplying thq result by 100 to 
1 5 yield the percentage of sequence identity. Optimal alignment of sequences for 
comparison may also be conducted by computerised implementations of known 
algorithms such as Altschul, Stephen R, Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped 
BLAST and PSI-BLAST: a new generation of protein database search programs", 
20 Nucleic Acids Res. 25:3389-3402. There are also algorithms available to the person 
skilled in the art that enable a calculation of the percentage sequence identity between 
polynucleotide sequences. The variant may differ from the protein depicted as SEQ ID 
NO: 14 in particular by conservative substitutions. Such conservative substitutions are 
described above. 

25 The present invention will now be described by way of the following non- 

limiting examples with reference to the following Figures and Sequence Listing of 
which: 



SEQ ID NO: 1 = 12423 = Glu-Cat-SSU-crtl-Nos-Glu-Cat-Psy (Maize-gb)-nc 
30 SEQ ID NO: 2 = 12421 = Glu-Cat-SSU-crtl-Nos-Glu-Cat-Psy (Maize-ElB)-3 
SEQ ID NO: 3 = 12422 = Glu-SSU-crtl-Nos-Glu-Psy (Maize-ElB)-nos 
SEQ ID NO: 4 = 12424 = Glu-SSU-crtl-Nos-Glu-Psy (Maize-gb)-nos 
SEQ ID NO: 5 = Glu-Cat-SSU-crtl-Nos-Glu-Cat-Psy (Maize)-nos 
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SEQ ID NO: 6 = 1 1586 = Glu-Cat-SSU-crtl-Nos-Glu-Cat-Psy (Rice)-nos 
SEQ ID NO: 7 = 7651= Glu-Cat-SSU-crtl-Nos-Glu-Cat-Psy (Pepper)-nos 
SEQ ID NO: 3 = 7650= Glu-Cat-SSU-crtl-Nos-Glu-Cat-Psy (Tomato)-nos 
SEQ ID NO: 9 = Glu-Cat-SSU-crtl-Nos-Glu-Cat-SSU-Psy (crtB)-nos 
SEQ ID NO: 1 0 = Phytoene synthase gb (Maize) 

SEQ ID NO: 1 1 = Phytoene synthase (Maize) from SEQ ID NO 5 above 

SEQ ID NO: 12 = Phytoene synthase E1B (Maize) 

SEQ ED NO: 13 = Phytoene synthase (Rice) 

SEQ ID NO: 14 = Phytoene synthase (Rice) PROTEIN 

SEQ ID NO: 15 = Phytoene synthase (Pepper) , 

SEQ ID NO: 16 = Phytoene synthase (Tomato) 

SEQ ID NO: 17 = Phytoene synthase (Erwinia crtB) 

SEQ ID NO: 18 = Carotene desaturase (Erwinia crtl) used in SEQ ID NOS: 1-4 

SEQ ID NO: 19 = Carotene desaturase (Erwinia crtl) 

SEQ ID NO: 20 = Glutelin seed preferred promoter 

SEQ ID NO: 21 = Prolamin seed preferred promoter 

SEQ ID NO: 22 = Intron from catalase gene 

SEQ ID NO: 23 = Transit peptide (Small sub-unit Rubisco) 

SEQ ID NO: 24 = Transcription termination region from nopaline synthase gene 

SEQ ID NO: 25 = Transcription termination region from 35S CaMV 

SEQ ID NO: 26 = Transcription termination region from proteinase inhibitor form potato 

SEQ ID NO: 27 = Carotene desaturase (Tomato) 

SEQ ID NO: 28 = Carotene desaturase (Pepper) 

SEQ ID NO: 29= Carotene desaturase (Maize) 

SEQ ID NO: 30 = Carotene desaturase (Rice) 

SEQ ID NO: 31 = Zeta-carotene desaturase (Tomato) 

SEQ ID NO: 32 = Zeta-carotene desaturase (Pepper) 

SEQ ID NO: 33 = Zeta-carotene desaturase (Maize) 

SEQ ID NO: 34 = Zeta-carotene desaturase (Rice) 

SEQ ID NOS: 35 to 38 = Primers 



Glu = Glutelin seed preferred promoter 
Pro = Prolamin seed preferred promoter 
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Cat = Intron from catalase gene 
SSU = Transit peptide (Small sub-unit Rubisco) 
CrtI = Carotene desaturase from Ervvinia 
Pds = Carotene desaturase (source indicated in parenthesis) 
Psy = Phytoene synthase (source indicated in parenthesis) 
Zds = Zeta-carotene desaturase 
Pds = Phytoene desaturase 

Nos = Transcription termination region from nopaline synthase gene 
35S term = Transcription termination region from 35S CaMV 

PotPl-H term = Transcription termination region from proteinase inhibitor form potato 

FIGURE 1 = Part of carotenoid biosynthesis pathway starting from GGPP 
(Geranylgeranyl diphosphate). 
FIGURE 2 = Construct pPRPOl 17. 

EXAMPLES 

General molecular biology methods are carried out according to Sambrook et al 
(1989) 'Molecular cloning: A laboratory Manual, 2nd Edition. Cold Spring Harbour Lab. 
Press. 

LQ. Construction of plant binary vectors 

References for the gene sequences below are given as they are listed on the EMBL 
database. This database is maintained and distributed by the European Bioinformatics 
Institute (Patricia Rodriguez-Tome , Peter J. Stoehr , Graham N. Cameron and Tomas P. 
Flores, "The European Bioinformatics Institute (EBI) databases", Nucleic Acids Res. 
24.(6-13), 1996, www.ebi.ac.uk.) 

A pUC based vector pPRPOl 17 (Figure 2) was used for cloning of all the plant 
transformation vectors. This contains the nucleotides -806 to +33 of the rice glutelin 
gene as a promoter CY006S7), the first intron of the catalase-1 gene from castor bean 
altered to remove the ATG sequences, a gus coding region and a nos terminator. The gus 
coding sequence was removed by digestion of pPRPOl 17 with Ncol and Sfi 1 and 
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replaced by the coding regions of the carotenoid phytoene synthase genes or phytoene 
desaturase genes, or removed as a gus::nos cassette by digestion of pPRPOl 17 with Ncol 
and Pacl and replaced by a carotenoid coding region/ nos terminator fusion as described 
below. 

5 

1 . 1 Construction of the pPRPO 1 1 7 +crtl vector 

A cassette of the signal peptide of the small subunit of pea ribulose bisphosphate 
carboxylase (SSU) (X00806) fused to the bacterial phytoene desaturase Crtl (D90087) 
and a nos terminator v/as cloned into the Ncol and Pacl sites of pPRPOl 17. The crtl 
10 sequence in constructs 7651, 7650 and 1 15S6 had 9 nucleotides extra (3 alanines) 
inserted after the first ATG in order to incorporate a NotI restriction site for cloning 
purposes. 

1 .2 Construction of the pJHO 1 04HvgCrtI binary vector 

15 The Sgfl Gt::intron::SSUcrtI::n6s cassette was cloned into the Pad site of the binary 
vector pJH0104+Hyg (which contains a hygromycin resistance gene for antibiotic 
selection) to give the construct pJH0104HygSSUCrtI. 

1 .3 Pepper t>sy + crtl construct 765 1 

20 The catalase intron and pepper psy (X68017) were separately amplified by PCR using 
primers that overlap both sequences, and then fused by recombinant PCR, and cloned 
into the EcoRI and Sfil sites of pPRP0117. The Gt::intron::pepper psy::nos cassette was 
recovered with Sgfl digestion and cloned into the Pad site of pJH0104HygCrtI to give 
the construct 7651. 

25 

1 .4 Tomato psy + crtl construct 7650 

The catalase intron and tomato psy (Y00521) were separately amplified by PCR using 
primers that overlap both sequences, and then fused by recombinant PCR and cloned into 
the EcoRI and Sfil sites of pPRP0117. The Gt::intron::tomato psynnos cassette was 
30 recovered by Sgfl and cloned into the Pad site of pJHO 1 04HygCrtI to give the construct 
7650. 
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1 .5 Rice psv ± crtl construct 1 1 586 

PolyA mRNA was extracted from rice leaves (Asanohikari). First strand synthesis of 
Rico psy cDNA was synthesized using the antisense primer 5 7 
cgtcggcctgcatggccctacttctggctatttctcagtg V (SEQ ID NO: 35) and cDNA was then 

5 obtained by PCR amplification with this antisense primer and the sense primer 

5'ctgtccatggcggccatcacgctcct 3' (SEQ ID NO: 36). This was digested with Ncol and Sfil 
and cloned into pFRPOl 17. The Gt::intron::rice psy::nos fragment was transferred to the 
binary vector pJH0104Hyg. A Hindlll/PacI Gt::intron::SSUcrtI::nos cassette was blunt 
ended and cloned into the Pmel site of the pJH0104HygRicepsy vector to give the 

) construct 11586. 

1.6 Maize psv ( E 1B) + crtl construct 12421 

PolyA mRNA was extracted from maize leaves. First strand synthesis of maize psy 
(sequence designated "E1B") cDNA was synthesized using the antisense primer 5' 
cgatggcctgcatggccctaggtctggccatttctcaatg 3' (SEQ ID NO: 37) and cDNA was then 
obtained by PCR amplification with this antisense primer and the sense primer 5' 
taggataagatagcaaatccatggccatcata 3' (SEQ ID NO: 38), This was digested with Ncol and 
Sfil and cloned into the Ncol and Sfil sites of a pPRPOl 17 based vector. The 
Gt::intron: maize psy::nos cassette was recovered with Hindm/Pacl digestion and cloned 
into a binary vector containing a Gt::intron::SSUcrtI::nos cassette to give the construct 
12421. 

1.7 Maize psv fElB) + crtl construct 12422 

The vector with the Gt::intron::maize psy::nos cassette in the pPRPOl 17 backbone (from 
construction of 12421 above) was digested with the restriction enzymes flanking the 
intron followed by religation of the vector in order to remove the catalase intron. The 
Gt::maize psy::nos cassette was recovered with Hindm/Pacl digestion and cloned into a 
binary vector containing a Gt::SSUcrtI::nos cassette to give the construct 12422. 

1 . 8 Maize ps v + crtl construct 1 2423 

The Yl maize psy (U32636) cds sequence was synthesised with Ncol and Sfil restriction 
sites added at the 5' and 3' end respectively. This was cloned into the Ncol and Sfil 
sites of a pPRPOl 17 based vector. The Gt::intron::maize psy::nos cassette was recovered 
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with HindlH/Pacl digestion and cloned into a binary vector containing a 
Gt::intron::SSUcrtI::nos cassette to give the construct 12423. 

1 .9 Maize psv + crtl construct 1 2424 

The vector with the Gt::intron::maize psy::nos cassette in the pPRPOl 17 backbone (from 
construction of 12423 above) was digested with the restriction enzymes flanking the 
intron followed by religation of the vector in order to remove the catalase intron. The 
Gt::maize psy::nos cassette was recovered with Hindm/Pacl digestion and cloned into a 
binary vector containing a Gt::SSUcrtI::nos cassette to give the construct 12424. 

2:0 Construction of vectors for plant transformation. 

When providing vectors for plant transformation which utilise Agrobacterium 9 it is 
preferred that the sequences according to the invention are inserted between the border 
regions of a single TDNA region. Agrobacterium may be transformed in accordance 
with methods which are well known to the person skilled in the art, and/or via the 
methods disclosed herein. 

3 . 0 Transform ationofRice 

Based on the protocol published by Hiei et al (1994 The Plant Journal, 6 (2), 271-282). 
The key modification involves use of a supervirulent strain in combination with a 
standard binary vector. 
The basic procedure is as follows: 

Mature rice seed (Asanohikari) are de-husked and surface sterilised by 70% ethanol for 
one minute followed by 4% Sodium hypochlorite + Tween for 30 minutes. Seed are then 
sown on a callus induction media (CIM) (N6 salts, N6 vitamins, 30 g/1 sucrose, 1 g/1 
casein hydrolysate, 2 mg/1 2,4-D, pH 5.8, 4 g/1 Gelrite) and placed in the dark at 30°C. 
After 3 weeks embryogenic calli are isolated and plated on CIM and placed under the 
same conditions. At the same time Agrobacterium cultures (AGL1 strain plus binary) are 
established by spreading inoculum onto LB plates plus Kanamycin (50 mg/1). After three 
days, Agrobacterium is scraped from the plate and re-suspended in AA1 + AS (AA salts, 
B5 vitamins, AA Amino Acids, 68.5 g/1 sucrose, 36 g/1 glucose, 0.5 g/1 casein 
hydrolysate, 100 \xM Acetosyringone, pH 5.2) to an optical density of 0.1 at 600 nm. The 
embryogenic calli is inoculated with the Agrobacterium solution for 10 minutes after 
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which the calli are spread on to plates of R2COMAS (R2 Micro salts, !4 R2 Macro salts, 
B5 Vitamins, 20 g/1 sucrose, 10 g/1 glucose, 1 g/1 casein hydrolysate, 2 mg/1 2,4-D, 100 
MM Acetosyringone, pH 5.2) and placed in the dark at 26°C. After three days calli are 
transferred on to selection media (N6 salts, N6 vitamins, 30 g/1 sucrose, 1 g/1 casein 
5 hydrolysate, 2 mg/1 2,4-D, 300 mg/1 Timentin, 50 mg/1 Hygrornycin, 4 g/1 Gelrite, pH 
5.8) and place in the light at 30°C. All of the following steps occur under the same 
growth conditions (Light, 30°C). After three weeks the putative transgenic calli are 
transferred on a pre-regeneration media (N6 salts and vitamins, 30 g/1 sucrose, 1 g/1 
casein hydrolysate, 1 mg/1 2,4-D, 300 mg/1 Timentin, 50 mg/1 Hygrornycin, 6 g/1 Gelrite, 
) pH 5.8). After two weeks the good quality embryogenic calli is transferred to 
regeneration media (N6 Micro, / 2 N6 Macro, N6 vitamins, AA amino acids, 20 g/1 
sucrose, 1 g/1 casein hydrolysate, 0.2 mg/1 NAA, 1 mg/1 Kinetin, 50 mg/1 Hygrornycin, 
pH 5.8, Gelrite 6 g/1). After three weeks, plantlets that regenerate are subcultured to 
rooting media 04 MS salts, !4 B5 Vitamins, 10 g/1 sucrose, 25 mg/1 Hygrornycin, pH 5.8, 
8 g/1 microagar. After two weeks the plantlets that have robust root systems are 
transferred to soil (50% John Innes #3, 50% peat, 26°C, 16 hour photoperiod) and 
covered with a fleece until the plants are established. 

fLQ Analysis of carotenoids in rice transformants 

Carotenoids are extracted from seeds harvested at maturity. Seed is dehusked using a 
TR-200 Electromotion Rice Husker and then polished for 1 min. with a Pearlest polisher 
(Kett). Any white or discoloured seed are removed and 0.5g of the sample is ground for 
2 minutes using a Glen Creston 8000 Mixer/Mill. A total of lg of ground material/plant 
may be obtained by this method. This can be thoroughly mixed together prior to 
extraction of 2 x 0.5g portions of the powder. A standard compound can then be added to 
the samples for quantification of recoveries. Astaxanthin and echinenone are examples 
of standards which can be used. Samples are hydrated with 1ml of water and mixed 
using a vortex for a few seconds. 6ml of acetone is then added and the samples sonicated 
for 2 minutes. The samples are centrifuged for 5 min. at 3500 rpm. The supernatants are 
decanted, and the samples re-extracted twice more with 3 ml acetone, repeating the 
sonication/centrifugation steps between extractions. One extraction with 2 ml of tert- 
methylbutylether is then performed including the sonication/centrifugation steps and all 
supernatants for one sample pooled together. The total volume for each extract is 
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adjusted to 14 ml with acetone and the centrifugation step repeated. A 2ml aliquot of 
each sample is evaporated to dryness under a stream of nitrogen gas. These aliquots are 
re-dissolved in 75 |il of ethyl acetate, vortexed for 5-10 s and then transferred to amber 
HPLC insert vials. The vials are sealed immediately and centrifuged again at 3500 rpm 
5 prior to HPLC analysis. 

The HPLC quantification is based on the response factor determined for each reference 
standards. The overall concentration of the prepared and dissolved reference standards is 
measured spectrophotometrically based oil the published molar extinction coefficients 
and/or absorbance measured for solution of 1% concentration using 1cm path length 
10 (Al%lcm) (Ref: Britton G., Liaanen- Jensen S. and Pfander H. P. (1995) Carotenoids: 
Spectroscopy Vol IB pp57-62, Birkhauser Verlag, Basel, ISBN 3-7643-2909-2). For 
each standard stock solution, the purity of the principal component is determined by 
HPLC. For components where no reference standard is available e.g. various cis isomers 
quantitative results are expressed using the response factor for that of |5-carotene. 

15 

4. 1 HPLC Equipment & Conditions 



Pump 




Agilent 1 100 Quaternary or Binary Pump; 
model number ; G1311 A or G1312A, 
respectively 


Degasser 




Agilent 1100 Degasser; model number G1322A 


Temperature Controlled 
Autosampler 




Agilent 1100 Automatic Liquid Sampler; model 
number G1313A equipped with autosampler 
temperature controller model number G1330A 


Detector 




Agilent 1 100 Diode array detector; model 
number G1315A or G1315B 
Agilent 1100 fluorescence detector; model 
number G1321A (for tocopherol analysis only). 


Column Oven 




Agilent 1 100 Column Compartment; model 
number G1316A 
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Instrument Conditions 




Column 


: YMC C30 3 pn particles in 25 cm x 4.6 mm Id 
stainless steel column + 1cm x 4 mm 5pm 
YMC C30 guard column 


Column temperature 


25°C 


Sample temperature ; 


4°C 


Mobile phase : 


Solvent A = MeOH/H20/tert-butylmethylether 
(TBME) + 1.3mM NH4acetate (70/25/5 v/v) 
Solvent B - MeOH/H20/TBME + 1 JmM 
NH4acetate (7/3/90 v/v) 


Stop time 


30min 


Post time 


Omin 


Flow rate 


1 ml min-1 


Injection volume 


25 jj.1 in ethyl acetate 



Gradient conditions (6%/min): 



Time 


A% 


B% 


(min) 


MeOH/H20/TB 


MeOH/H20/TB 




ME +1.3mM 


ME + 1.3mM 




NH4acetate 


NH4acetate 




(70/25/5 v/v) 


(7/3/90 v/v) 


0 


95 


5 


15.83 


0 


100 


22 


0 


100 


24 


95 


5 


30 


95 


5 



Results of HPLC quantification of oarotenoids: 

Results for rice plants transformed with construct 1 1586 (construct described in 1.5 
above) 
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Sample 


ug/g dry weight (dwt) 


identity 


endosperm 


wiiu. iy|^e 


0 05 


11586-10 


4.19 


11586-7 


6.11 


11586-25 


6.32 


11586-15 


7.55 


11586-30 


7.69 


11586-28 


9.05 


11586-14 


11.82 


11586-20 


12.82 


11586-1 


13.29 


11586-12 


18.59 



Sequence ID Number 


Components 


pg /g dwt endosperm 


7 


crtl + pepper psy 


>3 


8 


crtl + tomato psy 


>3 


9 


crtl + crtB 


>5 




Untransformed Control 


0 



6.0 Transformation of rice - method used for transformation of rice with 
5 Agrobacterium comprising constructs 12421, 12422, 12423 and 12424 (constructs 
described in examples 1.6, 1.7, 1.8 and 1.9 respectively). 



For this example, rice (Oryza sativa) is used for generating transgenic plants. Various 
rice cultivars can be used (Hiei et al., 1994, Plant Journal 6:271-282; Dong et aL, 1996, 

10 Molecular Breeding 2:267-276; Hiei et al., 1997, Plant Molecular Biology, 35:205-218). 
Also, the various media constituents described below may be either varied in 
concentration or substituted. Embryogenic responses are initiated and/or cultures are 
established from mature embryos by culturing on MS-CIM medium (MS basal salts, 4.3 
g/liter; B5 vitamins (200 x), 5 ml/liter; Sucrose, 30 g/liter; proline, 500 rng/liter; 

15 glutamine, 500 mg/liter; casein hydrolysate, 300 mg/liter; 2 3 4-D (1 mg/ml), 2 ml/liter; 
adjust pH to 5.8 with 1 N KOH; Phytagel, 3 g/liter). Either mature embryos at the initial 
stages of culture response or established culture lines are inoculated and co-cultivated 
with the Agrobacterium strain LBA4404 containing the desired vector constaiction. 
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Agrobacterium is cultured from glycerol stocks on solid YPC medium (100 mg/L 
spectinomycin and any other appropriate antibiotic) for ~2 days at 28 °C. Agrobacterium 
is re-suspended in liquid MS-CM medium. The Agrobacterium culture is diluted to an 
OD600 of 0.2-0.3 and acetosyringone is added to a final concentration of 200 uM. 
5 Agrobacterium is induced with acetosyringone before mixing the solution with the rice 
cultures. For inoculation, the cultures are immersed in the bacterial suspension. The 
liquid bacterial suspension is removed and the inoculated cultures are placed on co- 
cultivation medium and incubated at 22°C for two days. The cultures are then 
transferred to MS-CIM medium with Ticarcillin (400 mg/liter) to inhibit the growth of 
10 Agrobacterium. For constructs utilizing the PMI selectable marker gene (Reed et al., In 
Vitro Cell. Dev. Biol.-Plant 37: 127-132), cultures are transferred to selection medium 
containing Mannose as a carbohydrate source (MS with 2%MannoSe, 300 mg/liter 
Ticarcillin) after 7 days, and cultured for 3-4 weeks in the dark. Resistant colonies are 
then transferred to regeneration induction medium (MS with no 2,4-D, 0.5 mg/liter IAA, 
15 1 mg/liter zeatin, 200 mg/liter timentin 2% Mannose and 3% Sorbitol) and grown in the 
dark for 14 days. Proliferating colonies are then transferred to another round of 
regeneration induction media and moved to the light growth room. Regenerated shoots 
are transferred to GA7-1 medium (MS with no hormones and 2% Sorbitol) for 2 weeks 
and then moved to the greenhouse when they are large enough and have adequate roots. 
) Plants are transplanted to soil in the greenhouse and grown to maturity. 

1£ Method of extraction/quantification of carotftnnirlg - f or plants transformed in 

accordance with Example 6.0 

(a) Sample is grinded in a Geno/Grinder at 1 600 rpm for. 40 seconds. 

(b) Representative amounts of homogenised sample (0. 1 g) are weighed into an 
appropriate extraction vessel (e.g. 2 ml microcentrifuge tube). Sample weight is recorded. 

(c) Samples are hydrated with water (200 fxl) and vortexed for about 2-3 seconds then left 
to stand for about 1 0 minutes. 

(d) Acetone (1 .2 ml) is added to the sample. 

(e) Ultrasonicate sample for 5 minutes. 

(f) Centrifuge sample for 3 min at 6000 rpm, then transfer the supernatant to another 
appropriately sized tube. 

(g) Steps (d) to (f) are repeated and combined with the previous extract. 
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(h) Steps (d) to (f) are repeated with tert-butylmethylether (400 jllI) and combine with the 
acetone extracts. 

(i) All of the combined extracts are transferred into an appropriate size tube and 
evaporate it to dryness under a stream of nitrogen gas. 

5 (j) Samples are redissolved in 2 ml of ethyl acetate + 0.5% BHT, vortex or ultrasonicate 
for 5-10 seconds and then centrifiiged at 3500 rpm for 5 minutes before HPLC analysis, 
(k) Absorption at wavelength 450 nm is measured on a spectrophotometer for all samples 
and total carotenoid concentration is calculated based on an e value of 124865. 

10 8.0 Results of the method of Example 7.0 



Construct 


Sample ID 


Calculated 
Carotenoids ftig/g) 


12421 


RIGQ2003 00 1 046 A4 A 


48.7 


12421 


RIGQ2003 00 1 063A43A 


• 42.7 


12421 


RIGQ2003001063A4A 


36.9 


12421 


RIGQ2003001063A99A 


31.4 


12421 


RIGQ2003001 063A59A 


30 


12421 


RIGQ2003 00 1 063A75A 


28.7 


12421 


RIGQ2003001048A3A 


28.5 


12421 


RIGQ2003001050A23A 


28 


12421 


RIGQ2003001050A13A 


26.8 


12421 


RIGQ2003001 048A25 A 


26.8 


12421 


RIGQ2003001048A61A 


26.1 


12421 


RIGQ2003001 049A46A 


23.4 


12421 


RIGQ2003000993A63A 


21.7 


12421 


RIGQ200300 1 049A43 A 


20.6 


12421 


RIGQ2003001049A26A 


10.5 


12421 


RJGQ2003001 050A1 8 A 


5.7 




Construct 


Sample ID 


Calculated 
Carotenoids (ng/g) 


12422 


RJGQ2003001045A30A 


58.8 


12422 


RIGQ2003001045A51A 


54 


12422 


RJGQ2003000995A29A 


43.6 


12422 


felGQ2003000995A3 1 A 


43.1 


12422 


RIGQ2OO30O1O43A10A 


42.4 


12422 


RIGQ2003001043A8A 


42 


12422 


RJGQ2003000995A19A 


41.9 


12422 


RJGQ2003000995 Al 7A 


40.9 
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Constru 


rt Sample ID 


Calculated 

Carotenoids (ng/e) 


12422 


RIGQ2003001060A23i 


\ 37.2 


12422 


RIGQ2003001052A56j 


\ 35 


12422 


RIGQ2003001051A75; 


\ 32 


12422 


RIGQ2003001045A68; 


V 30.6 


12422 


RIGQ2003001045A86> 


V 29.8 


12422 


RIGQ2003001045A49^ 


^ 29.2 


12422 


RJGQ2003001060A25/ 


k. 28.9 


12422 


RIGQ2003001051A64^ 


i 26.4 


12422 


RIGQ2003001051A34^ 


25.2 


12422 


RIGQ2003000994A7A 


22.8 


12422 


RIGQ2003001045A74A 


20.3 


12422 


RIGQ2003001051A46A 


16.6 


12422 


RIGQ2003000995A7A 


13.8 


12422 


RIGQ2003000995A26A 


8.1 


12422 


RJGQ2003000995A41A 


6.8 


12422 


RIGQ2003000995A8A 


3.8 




Construct 


Sample ID 


Calculated 
Carotenoids fu?/g) 


12423 ] 


WGQ2003001097A42A 


52.80 


12423 I 


UGQ2003001097A15A 


50.40 


12423 I 


UGQ2003001097A41A 


48.00 


12423 I 


UGQ2003001114A10A 


44.20 


12423 ; 


RIGQ2003001099A8A 


40.00 


12423 ] 


EUGQ2003001098A2A 


28.80 


12423 E 


UGQ2003001099A42A 


26.00 


12423 B 


JGQ2003001097A30A 


24.00 


12423 R 


JGQ2003001186A80A 


23.30 


12423 R 


JGQ2003001097A18A 


21.60 


12423 R 


IGQ2003001099A60A 


17.60 


12423 f 


UGQ2003001098A5A 


17.40 


12423 R 


IGQ2003001097A44A 


12.00 



Construct 


Sample ID 


Calculated 
Carotenoids (Ug/g) 


12424 


RJGQ2003001121A60Ai 


51.9 


12424 


RIGQ2003001121A20A 


r 51.8 


12424 


RIGQ2003001093A11A 


40 


12424 


RIGQ2003001094A85A 


37.8 ■ 


12424 


RJGQ2003001093A58A 


27.3 
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Construct 


Sample ID 


Calculated 

Carotenoids (jJtg/g) 


12424 


RIGQ2003001 1 1 8A84A 


26 



Although the invention has been described by way of the above-referenced examples and 
the Sequence Listing and Figures as provided herein, it will be apparent that 
modifications and changes may be practiced which remain within the ambit of the 
5 present invention. 
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X \^VjLJ\sL 
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CLAIMS 

1 . A polynucleotide comprising: 

(a) a region which comprises as operably linked components (i) a promoter which 
provides for seed preferred expression; and (ii) a nucleotide sequence derived 
from a bacterium which sequence encodes a carotene desaturase; and (iii) a 
transcription termination region; and 

(b) a further region which comprises as operably linked components (i) a 
promoter which provides for seed preferred expression; and (ii) a nucleotide 
sequence encoding a phytoene synthase which sequence is derived from maize 
(Zea sp.) or rice (Orzya sp.); and (iii) a transcription termination region. 

2. A polynucleotide according to claim 1 wherein the sequence which encodes the 
carotene desaturase is derived from Erwinia sp. 

3. A polynucleotide according to claim 1 or claim 2 wherein said promoter is 
selected from the Glutelin 1 promoter and the Prolamin promoter and said 
transcription termination region is selected from the Nos; CaMV 35S and PotPl- 
II transcription termination regions. 

4. A polynucleotide according to any one of claims 1 to 3 wherein the sequence 
which encodes carotene desaturase and the sequence which encodes phytoene 
synthase further comprises a sequence encoding a plastid targeting sequence. 

A polynucleotide according to any one of claims 1 to 4 wherein said region 
and/or said further region further comprises an intron. 

A polynucleotide according to any one of claims 1 to 5 which comprises a 
sequence selected from the group depicted as SEQ H) NO: 1; 2; 3; 4; and 6. 

A polynucleotide sequence which is the complement of one which hybridises to a 
polynucleotide according to claim 6 at a temperature of about 65°C in a solution 
containing 6 x SSC, 0.01% SDS and 0.25% skimmed milk powder, followed by 
rinsing at the same temperature in a solution containing 0.2 XSSC and 0.1% SDS 
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wherein said polynucleotide sequence still comprises a region encoding a 
carotene desaturase and a further region encoding a phytoene synthase and when 
said polynucleotide sequence is inserted into plant material the seed of a plant 
regenerated from said material produce an increased amount of carotenoids when 
compared to a control like-seed. 

8. A polynucleotide sequence according to claim 7 wherein when said 
polynucleotide sequence is inserted into plant material, the seed of a plant 
regenerated from said material produces at least a sixty fold increase in 
carotenoids when compared to a control like-seed. 

9. A polynucleotide sequence according to claim 7 wherein when said 
polynucleotide sequence is inserted into plant material, the seed of a plant 
regenerated from said material produces at least a three hundred and fifty fold 
increase in carotenoids when compared to a control like-seed. 

10. A polynucleotide sequence according to claim 7 wherein when said 
polynucleotide sequence is inserted into plant material the seed of a plant 
regenerated from said material produces carotenoids at a level of at least lOfig/g 
of endosperm of said seed. 

11. A polynucleotide sequence according to claim 7 wherein when said 
polynucleotide sequence is inserted into plant material the seed of a plant 
regenerated from said material produces carotenoids at a level of at least 1 5\xg/g 
of endosperm of said seed. 

12. A polynucleotide sequence which is the complement of one which hybridises to a 
polynucleotide according to claim 6 at a temperature of about 65°C in a solution 
containing 6 x SSC, 0.01% SDS and 0.25% skimmed milk powder, followed by 
rinsing at the same temperature in a solution containing 0.2 x SSC and 0.1% SDS 
wherein said polynucleotide sequence still comprises a region encoding a 
carotene desaturase and a further region encoding a phytoene synthase and when 
said polynucleotide sequence is inserted into plant material the seed of a plant 
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regenerated from said material produce carotenoids amounting to at least 80% of 
the carotenoid content of a seed which comprises a polynucleotide selected from 
the group depicted as SEQ ID NO: 1; 2; 3; 4; 5 and 6. 

13. A polynucleotide sequence according to claim 1 2 wherein v/hen said 
polynucleotide sequence is inserted into plant material the seed of a plant 
regenerated from said material produces carotenoids amounting to at least 100% 
of the carotenoid content of a seed which comprises a polynucleotide selected 
from the group depicted as SEQ ID NO: 1 ; 2; 3; 4; 5 and 6. 

14. A polynucleotide sequence according to any one of claims 7 to 1 3 wherein said 
seed is a rice seed. 

15. A polynucleotide or a polynucleotide sequence according to any one of claims 1 
to 14 which further comprises a region which encodes a selectable marker. 

1 6. A polynucleotide or a polynucleotide sequence according to claim 1 5 wherein 
said selectable marker comprises a mannose-6-phosphate isomerase gene. 

1 7. A polynucleotide or a polynucleotide sequence according to any one of claims 1 
to 1 6 which is codon optimised for expression in a particular plant species. 

1 8. A polynucleotide or a polynucleotide sequence according to claim 1 7 wherein 
said plant species is rice (Orzya sp.), 

1 9. A vector comprising a polynucleotide or a polynucleotide sequence according to 
anyone of claims 1 to 18. 

20. A method for increasing the carotenoid content of seeds comprising inserting into 
plant material a polynucleotide or a polynucleotide sequence according to any one 
of claims 1 to 18 or a vector according to claim 19; and regenerating a seed- 
containing plant from said material and identifying the seeds which contain 
carotenoids at levels greater than those of control like-seeds. 
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21. A method for increasing the carotenoid content of a seed comprising inserting 
into plant material a polynucleotide comprising a sequence selected from the 
group depicted as SEQ ID NO: 1; 2; 3; 4; 5 and 6 and regenerating a seed- 
containing plant from said material and identifying the seed which contains 
carotenoids at levels greater than those of a control like-seed. 

22. A method according to claim 20 or claim 21 wherein said seed contains at least a 
sixty fold increase in carotenoids when compared to a control like-seed. 

23. A method according to claim 22 wherein said seed contains at least a three 
hundred and fifty fold increase in carotenoids when compared to control like- 
seed. 

24. A method according to claim 20 or claim 21 wherein said seed contains 
carotenoids at a level of at least lOjig/g of endosperm of said seed. 

25. A method according to claim 24 wherein said seed contains carotenoids at a level 
of at least 15jig/g of endosperm of said seed. 

26. A method according to any one of claims 20 to 25 wherein said carotenoids are 
selected from the group consisting of: lycopene; alpha-carotene; lutein; beta- 
carotene; zeanthin; antheraxanthin; violaxanthin; and neoxanthin or a 
combination thereof 

27. A seed obtained by a method according to any one of claims 20 to 26. 

28. A seed according to claim 27 which is a rice seed. 

29. A plant which comprises a seed according to claim 27 or claim 23. 
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30. A plant or plant material which comprises a polynucleotide or a polynucleotide 
sequence according to any one of claims 1 to 18 or a vector according to claim 



19. 



5 31 . A plant or plant material according to claim 30 which is a rice plant or is rice 
plant material. 

32. A plant or plant material according to claim 30 which is a maize plant or is maize 
plant material. 



10 



33. A plant according to any one of claims 29 to 32 which further comprises a 
polynulceotide which provides for a trait selected from the group consisting of: 
insect resistance and/or tolerance; nematode resistance and/or tolerance; herbicide 
resistance and/or tolerance; improved resistance and/or tolerance to stress; a 

15 substance having pharmaceutical activity; and any other desired agronomic trait. 

34. Use of a polynucleotide or a polynucleotide sequence according to any one of 
claims 1 to 1 8 or a vector according to claim 19 in a method for the production-of 
seeds containing increased carotenoids. 



20 



35. Use of a polynucleotide selected from the group depicted as SEQ ID NO: 1 ; 2; 3; 
4; 5 and 6 for the production of a seed which contains carotenoids at levels 
greater that those of a control like-seed. 



25 36. 



Use of a polynucleotide or a polynucleotide sequence according to any one of 
claims 1 to 18 or a vector according to claim 19 in a method for the production of 
a plant comprising said polynucleotide, said polynucleotide sequence or said 
vector. 



30 37. 



Use of a polynucleotide selected from the group depicted as SEQ ID NO: 1; 2; 3; 
4; 5 and 6 in a method for the production of a plant comprising said 
polynucleotide. 
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Figure 
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Figure 2 
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10/549352 

XOSRec'd POTTO i 4 SEP 2flM' 



<110> SYNGENTA LIMITED 

<120> ENHANCED ACCUMULATION OF CAROTENO IDS IN PLANTS 

<130> 70237/WO 

<150> US60/457, 053 
<151> 2003-03-24 

<160> 38 

<170> Patentln version 3.1 ■ 

<210> 1 

<211> 5630 

<212> DNA 

<213> SYNTHETIC - 12423 



<400> 1 



gttaatcatg 


gtgtaggcaa 


cccaaataaa 


acaccaaaat 


atgcacaagg 


cagtttgttg 


60 


tattctgtag 


tacagacaaa 


actaaaagta 


atgaaagaag 


atgtggtgtt 


agaaaaggaa 


120 


acaatatcat 


gagtaatgtg 


tgagcattat 


gggaccacga 


aataaaaaga 


acattttgat 


180 


gagtcgtgta 


tcctcgatga 


gcctcaaaag 


ttctctcacc 


ccggataaga 


aacccttaag 


240 


caatgtgcaa 


agtttgcatt 


ctccactgac 


ataatgcaaa 


ataagatatc 


atcgatgaca 


300 


tagcaactca 


tgcatcatat 


catgcctctc 


tcaacctatt 


cattcctact 


catctacata 


360 


agtatcttca 


gctaaatgtt 


agaacataaa 


cccataagtc 


acgtttgatg 


agtattaggc 


420 


gtgacacatg 


acaaatcaca 


gactcaagca 


agataaagca 


aaatgatgtg 


tacataaaac 


480 


tccagagcta 


tatgtcatat 


tgcaaaaaga 


ggagagctta 


taagacaagg 


catgactcac 


540 
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aaaaattcat ttgcctttcg tgtcaaaaag aggagggctt tacattatcc atgtcatatt 

600 

gcaaaagaaa gagagaaaga acaacacaat gctgcgtcaa ttatacatat ctgtatgtcc 660 
atcattattc atccaccttt cgtgtaccac acttcatata tcatgagtca cttcatgtct 720 
ggacattaac aaactctatc ttaacattta gatgcaagag cctttatctc actataaatg 780 
cacgatgatt tctcattgtt tctcacaaaa agcattcagt tcattagtcc tacaacaacg 840 
aattcggctt cccgggtaca gggtaaattt ctagtttttc tccttcattt tcttggttag 900 
gacccttttc tctttttatt tttttgagct ttgatctttc tttaaactga tctatttttt 960 
aattgattgg ttatcgtgta aatattacat agctttaact gataatctga ttactttatt 1020 
tcgtgtgtct ttgatcatct tgatagttac agaaccgtcg actctagaga agccatttaa 1080 
atcgccgcca ccatggcttc tatgatatcc tcttccgctg tgacaacagt cagccgtgcc 1140 
tctagggggc aatccgccgc agtggctcca ttcggcggcc tcaaatccat gactggattc 12 00 
ccagtgaaga aggtcaacac tgacattact tccattacaa gcaatggtgg aagagtaaag 12 60 
tgcatgaaac caactacggt aattggtgca ggcttcggtg gcctggcact ggcaattcgt 1320 
ctacaagctg cggggatccc cgtcttactg cttgaacaac gtgataaacc cggcggtcgg 1380 
gcttatgtct acgaggatca ggggtttacc tttgatgcag gcccgacggt tatcaccgat 1440 
cccagtgcca ttgaagaact gtttgcactg gcaggaaaac agttaaaaga gtatgtcgaa 1500 
ctgctgccgg ttacgccgtf ttaccgcctg tgttgggagt cagggaaggt ctttaattac 1560 
gataacgatc aaacccggct cgaagcgcag attcagcagt ttaatccccg cgatgfccgaa 162 0 
ggttatcgtc agtttctgga ctattcacgc gcggtgttta aagaaggcta tctgaagctc 1680 
ggtactgtcc cttttttatc gttcagagac atgcttcgcg ccgcacctca actggcgaaa 1740 
ctgcaggcat ggagaagcgt ttacagtaag gttgccagtt acatcgaaga tgaacatctg 1800 
cgccaggcgt tttctttcca ctcgctgttg gtgggcggca atcccttcgc cacctcatcc 1860 
atttahacgt tgatacacgc gctggagcgt gagtggggcg tctggtttcc gcgtggcggc 1920 
accggcgcat tagttcaggg gatgataaag ctgtttcagg atctgggtgg cgaagtcgtg 1980 
ttaaacgcca gagtcagcca tatggaaacg acaggaaaca agattgaagc cgtgcattta 2040 
gaggacggtc gcaggttcct gacgcaagcc gtcgcgtcaa atgcagatgt ggttcatacc 2100 
tatcgcgacc tgttaagcca gcaccctgcc gcggttaagc agtccaacaa actgcagact 2160 
aagcgcatga gtaactctct gtttgtgctc tattttggtt tgaatcacca tcatgatcag 2220 
ctcgcgcatc acacggtttg tttcggcccg cgttaccgcg agctgattga cgaaattttt 2280 
aatcatgatg gcctcgcaga ggacttctca ctttatctgc acgcgccctg tgtcacggat 234 0 
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tcgtcactgg 


cgcctgaagg 


ttgcggcagt 


tactatgtgt 


tggcgccggt 


gccgcattta 


2400 


ggcaccgcga 


acctcgactg gacggttgag 


gggccaaaac 


tacgcgaccg 


tatttttgcg 


2460 


taccttgagc 


agcattacat 


gcctggctta 


cggagtcagc 


tggtcacgca 


ccggatgttt 


2520 


acgccgtttg 


attttcgcga 


ccagcttaat 


gcctatcatg 


gctcagcctt 


ttctgtggag 


2580 


cccgttctta 


cccagagcgc 


ctggtttcgg 


ccgcataacc 


gcgataaaac 


cattactaat 


2640 


ctctacctgg 


tcggcgcagg 


cacgcatccc 


ggcgcaggca 


• ttcctggcgt 


catcggctcg 


2700 


gcaaaagcga 


cagcaggttt 


gatgctggag 


gatctgattt 


gaggccatgc 


aggccgatcc 


2760 


ccgatcgttc 


aaacatttgg 


caataaagtt 


tcttaagatf gaatcctgtt 


gccggtcttg 


2820 


cgatgattat 


catataattt 


ctgttgaatt 


acgttaagca 


tgtaataatt 


aacatgtaat 


2880 


gcatgacgtt 


atttatgaga 


tgggttttta 


tgattagagt 


cccgcaatta 


tacatttaat 


2940 


acgcgataga 


aaacaaaata 


tagcgcgcaa 


actaggataa 


attatcgcgc 


gcggtgtcat 


3000 


ctatgttact 


agatcgggcc 


ttaataagct 


tgttaatcat 


ggtgtaggca 


acccaaataa 


3060 


aacaccaaaa 


tatgcacaag 


gcagtttgtt 


•gtattctgta 


gtacagacaa 


aactaaaagt 


3120 


aatgaaagaa 


gatgtggtgt 


tagaaaagga 


aacaatatca 


tgagtaatgt 


gtgagcatta 


3180 


tgggaccacg 


aaataaaaag 


aacattttga 


tgagtcgtgt 


atcctcgatg 


agcctcaaaa 


3240 


gfctctctcac 


cccggataag 


aaacccttaa 


gcaatgtgca 


aagtttgcat 


tctccactga 


3300 


cataatgcaa 


aataagatat 


catcgatgac 


atagcaactc 


atgcatcata 


tcatgcctct 


3360 


ctcaacctat 


4 

tcattcctac 


tcatctacat 


aagtatcttc 


agctaaatgt 


tagaacataa 


3420 


acccataagt 


cacgtttgat 


gagtattagg 


cgtgacacat 


gacaaatcac 


agactcaagc 


3480 


aagataaagc 


aaaatgatgt 


gbacataaaa 


ctccagagct 


atatgtcata 


ttgcaaaaag 


3540 


aggagagctt 


ataagacaag 


gcatgactca 


caaaaattca 


tttgcctttc 


gtgtcaaaaa 


3600 


gaggagggct 


ttacattatc 


catgtcatat 


tgcaaaagaa 


agagagaaag 


aacaacacaa 


3660 


tgctgcgtca 


attatacata 


tctgtatgtc 


catcattatt 


catccacctt 


tcgtgtacca 


3720 


cacttcatat 


atcatgagtc 


acttcatgtc 


tggacattaa' 


caaactctat 


cttaacattt 


3780 


agatgcaaga 


gcctttatct 


cactataaat 


gcacgatgat 


ttctcattgt 


ttctcacaaa 


3840 


aagcattcag 


ttcattagtc 


ctacaacaac 


gaattcggct 


tcccgggtac 


agggtaaatt 


3900 


tctagttttt 


ctccttcatt 


ttcttggtta 


ggaccctttt 


ctctttttat 


ttttttgagc 


3960 


tttgatcttt 


ctttaaactg 


atctattttt 


taattgattg 


gttatcgtgt 


aaatattaca 


4020 


tagctttaac 


tgataatctg 


attactttat 


ttcgtgtgtc 


tttgatcatc 


ttgatagtta 


4080 


cagaaccgtc gactctagag 


aagccattta 


aatcgccgcc 


accatggcca 


tcatactcgt 


4140 
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acgagcagcg tcgccggggc tctccgccgc cgacagcatc agccaccagg ggactctcca 4200 

gtgctccacc ctgctcaaga cgaagaggcc ggcggcgcgg cggtggatgc cctgctcgct 4260 

ccttggcctc cacccgtggg aggctggccg tccctccccc gccgtctact ccagcctgcc 4320 

cgtcaacccg gcgggagagg ccgtcgtctc gtccgagcag aaggtctacg acgtcgtgct 43 80 

caagcaggcc gcattgctca aacgccagct gcgcacgccg gtcctcgacg ccaggcccca 444 0 

ggacatggac atgccacgca acgggctcaa ggaagcctac gaccgctgcg gcgagatctg 4500 

tgaggagtat gccaagacgt tttacctcgg aactatgttg atgacagagg agcggcgccg 4560 

cgccatatgg gccatctatg tgtggtgtag gaggacagat gagcttgtag atgggccaaa 4620 

cgccaactac attacaccaa cagctttgga ccggtgggag aagagacttg aggatctgtt 4680 

cacgggacgt ccttacgaca tgcttgatgc cgctctctct gataccatct caaggttccc 4740 

catagacatt cagccattca gggacatgat tgaagggatg aggagtgatc ttaggaagac 4800 

aaggtataac aacttcgacg agctctacat gtactgctac tatgttgctg gaactgtcgg 4860 

gttaatgagc gtacctgtga tgggcatcgc aaccgagtct aaagcaacaa ctgaaagcgt 492 0 

atacagtgct gccttggctc tgggaattgc gaaccaactc acgaacatac tccgggatgt 4980 

tggagaggat gctagaagag gaaggatata tttaccacaa gatgagcttg cacaggcagg 5040 

gctctctgat gaggacatct tcaaaggggt cgtcacgaac cggtggagaa acttcatgaa 5100 

gaggcagatc aagagggcca ggatgttttt tgaggaggca gagagagggg taactgagct 5160 

ctcacaggct agcagatggc cagtatgggc ttccctgttg ttgtacaggc agatcctgga 522 0 

tgagatcgaa gccaacgact acaacaactt cacgaagagg gcgtatgttg gtaaagggaa 5280 

gaagttgcta gcacttcctg tggcatatgg aaaatcgcta ctgctcccat gttcattgag 5340 

aaatggccag acctagggcc atgcaggccg atccccgatc gttcaaacat ttggcaataa 5400 

agtttcttaa gattgaatcc tgttgccggt cttgcgatga ttatcatata atttctgttg 5460 

aattacgtta agcatgtaat aattaacatg taatgcatga cgttatttat gagatgggtt 5520 

ttfcatgatta gagtcccgca attatacatt taatacgcga tagaaaacaa aatatagcgc 5580 

gcaaactagg ataaattatc gcgcgcggtg tcatctatgt tactagatcg 5530 

<210> 2 
<211> 5630 
<212> DNA 

<213> SYNTHETIC - 12421 



